te | 
The LIBRARY HAWAI 
Der 27°59 


hilosophical Magazine 


FIRST PUBLISHED IN 1798 


A Journal of Theoretical 


Experimental and Applied Physics 


Vol. 3 September 1958 No. 33 
Eighth Series 


£1 5s. Od., plus postage 
Annual Subscription £13 10s. Od., payable in advance 


aaa 


Printed and Published by 


TAYLOR & FRANCIS LTD 
MELLON COURT, FLEET STREET, LONDON, E.C.4 


THE PHILOSOPHICAL MAGAZINE 


Editor 
Professor N. F. Mort, M.A., D.Sc., F.B.S. 


Editorial Board 


Sir LAWRENCE Brace, O.B.E., M.C., M.A., D.Sc., F.R.S. 
Sir Gzorcr THomson, M.A., D.Sc., F.R.S. 
Professor A. M. TynpDALL, C.B.E., D.Sc., F.R.S. 


AvutHors wishing to submit papers for publication in the Journal should 
send manuscripts directly to the Publishers. 


Manuscripts should be typed in double spacing on one side of quarto 
(8x10in.) paper, and authors are urged to aim at absolute clarity of 
meaning and an attractive presentation of their texts. 


References should be listed at the end in alphabetical order of authors 
and should be cited in the text in terms of author’s name and date. Dia- 
grams should normally be in Indian ink on white card, with lettering in 
soft pencil, the captions being typed on a separate sheet. 


A leaflet giving detailed instructions to authors on the preparation of papers 
is available on request from the Publishers. 


Authors are entitled to receive 25 offprints of a paper in the Journal free 
of charge, and additional offprints can be obtained from the Publishers. 


The Philosophical Magazine and its companion journal, Advances in Physics, 
will accept papers for publication in experimental and theoretical physics. 
The Philosophical Magazine publishes contributions describing new results, 
letters to the editor and book reviews. Advances in Physics publishes articles 
surveying the present state of knowledge in any branch of the science in which 
recent progress has been made. The editors welcome contributions from 
Overseas as well as from the United Kingdom, and papers may be published 
in English, French and German. 


[921 ] 


Random Waiks and Drift in Chemical Diffusion} 


By A. D. Lz Cuarre 


Metallurgy Division, Atomic Energy Research Establishment, 
Harwell, Berks. 


[Received April 15, 1958] 


ABSTRACT 


In self diffusion the average displacement X(t) of an atom after a time t is 


zero. It is pointed out that in chemical diffusion X is not necessarily zero, for 
‘there are several mechanisms tending to make an atom jump preferentially 
in one direction rather than the other along a chemical concentration gradient, 
i.e. tending to produce a drift of atoms superimposed upon their otherwise 
random movement. This leads to a substantial modification of the classical 


Einstein equation D=X°(t) /2t connecting the self-diffusion coefficient D with 


the mean square displacement X°(¢) and the equation for chemical diffusion 

which replaces it is derived. This is shown to lead, on the basis of a simple 

model, to Darken’s equation for the chemical diffusion coefficient. 
Although in both this and a more general model there are usually two 


independent mechanisms contributing to X, it turns out to be an interesting 
feature of chemical diffusion that only one of these is manifest in the chemical 


diffusion coefficient. The total drift X can be measured independently of the 
diffusion coefficient and values calculated from the theoretical expressions 
derived agree well with experimental measurements for zine and for copper 
in «-brass/copper diffusion couples. 

The results are generalized for chemical diffusion in multicomponent systems 
and expressions obtained for the constants in the familiar schemes of equa- 
tions used to describe multicomponent diffusion. It is concluded that the 
cross terms L;; in the Onsager scheme are not primarily the result of correla- 
tion between the directions of successive jumps of atoms, as has been 
suggested. 


§ 1. INTRODUCTION 


‘THE atomic process responsible for the observed effects of diffusion is 
generally accepted to be the Brownian motion which each atom performs 
as a result of thermal agitation. In particular, in a solid crystal, each 
atom is visualized as spending only a finite time on any one lattice site, 
then jumping to a neighbouring site, then to another, and so on, in this 
way persuing an endless path or ‘random walk’ throughout the crystal. 
In substitutional solid solution the jumps are most probably made into 
vacant lattice sites (vacancy diffusion) and a particular atom only has an 
opportunity to jump when a vacant site becomes adjacent to it as a result 
of the movements of the other atoms. In interstitial solid solution the 
rate of jumping of solute atoms is not limited by the availability of vacant 
sites, at least in dilute solutions. 

The only features of such a ‘random walk’ which are usually accessible to 


experiment are statistical quantities like X(t) and X(t), the average nett 
displacement and the average square of the nett displacement X of an atom 


+ Communicated by the Author. 


P.M. 3R 


922 A. D. Le Claire on the 


after a time ¢, measured along a given direction, the w-axis, the averages: 
being taken over a very large number of identical particles. For example, 
there exists the simple and well known Einstein (1905) relation between 
X(t) and the diffusion coefficient D, which is measured by determining 
the nett flux of atoms across a plane arising from their random movements. 
when there exists a concentration gradient of the atoms 


D=}3X It. Me re et) 


However, as we shall discuss shortly, this equation is valid only for 
random walks which take place in chemically homogeneous systems (Self 
Diffusion). It is the purpose of this paper to discuss the problem of 
random walks in systems with a chemical concentration gradient (Chemical 
Diffusion) and to derive the equation which replaces (1) in this case. 
Although the relation between chemical diffusion coefficients and self- 
diffusion coefficient has been discussed in the past, largely in thermodynamic 
terms (Darken 1948, Bardeen 1949, Bardeen and Herring 1951, Le Claire 
1953), little attention seems to have been given to a treatment in terms of 
the statistical features of the associated random walk. We shall find 
that although the well known Darken equation follows again from such an 
approach there is the further interesting result that in a chemical concentra- 
tion gradient atoms suffer a nett displacement X, only part of which is. 
made manifest in a chemical diffusion experiment. 

We begin by outlining a derivation of eqn. (1) which is then adapted to 
the chemical diffusion case. 


§ 2. SetF DirrusION 


Consider a crystal in which there exists along the w direction a concentra- 
tion gradient of radioactive atoms of one species but which is otherwise 
homogeneous chemically. We wish to know the nett rate of transfer of 
atoms across some reference plane xo, arising from their random motion. 

Consider an infinitesimal layer dx, at w,. At t=0 each atom within 
this layer begins a random walk. Let f(X, t) be the relative probability 
that in time ¢ an atom will have migrated a distance X from #,, measured 
along the x-axis. Since the medium is homogeneous chemically, migrations 
of +X and —X are equally probable, i.e. there is no tendency for an 
atom to drift preferentially in one direction or the other, so that the average 
displacement after ¢ of a large number of atoms is zero. 


= rot 
X= | X f (Xt) dX =0. titi. Go Fie ee 


Similarly, all odd moments of f (X, ¢), X3, X5 ete. are zero. 


But the average squared displacement X2 is not zero and after ¢ all the 
radioactive tracer atoms originally within the layer dx, will be spread 
out and distributed in a manner indicated by curve I, symmetrical about. 
x, because X=0. (The curve can readily be shown to be Gaussian, as. 
drawn, but its precise shape does not concern us here.) The area under: 


Random Walks and Drift in Chemical Diffusion 923 


the curve is c(x,) dx,, ¢(x) being the tracer concentration at x, and being 
Gaussian its half width is (X?)2, A similar curve II represents the distri - 
bution at ¢ of those tracer atoms originally contained in an infinitesimal 
layer at 2, an equal distance on the right-hand side of a). Its area is 
c(x,) dx, but its width is the same as curve I. 

The shaded area A represents the number of tracer atoms from a, which 
are on the right-hand side of «, after ¢ and area B the number from x, now 
on the left-hand side of w). Because A>B there has been a nett flow of 
atoms across 2 and in the direction of decreasing concentration: the 
total nett flow is to be got by summing the differences A—B for all pairs 
of regions like dx, and dx,. 


Fig. 1 


The total number of tracer atoms originally on the left-hand side of 
% Which will be found on the right-hand side after ¢ is 


[" ee) , [f(xy aX) de ee (3) 
The total integrand represents an area such as A. Similarly, the total 
number of atoms which have crossed x, from right to left, i.e. the sum of 


areas like B, is 
aco ( pte—% 


| ote) | | f(Ht) dX} de. ee 4 (4) 
The difference between (3) and (4) gives the nett transfer across %% 
from left to right. Expanding c(x) about its value at x 


924. A. D. Le Claire on the 


and substituting in (3) and (4), the difference between them becomes 


eteo| f° (f° Fe ix) dx — ial Sf (Xt) 1X) ae | + ae 
‘ Lf" we-an( fF" se x) de—| (o—a)( [" “FD 1X) ae| 
i ed con ONC IX) de 


— [ema (fren ax) de] 2 ccc eam 


4 Xo 
Integrating by parts twice and dividing by ¢ to give the average nett 
rate of flow J, we find 


yes (7) {e(e)X — ee 4 - =) 4 Ree } (7) 


Sinee for self-diffusion X, X? etc. are zero, this becomes Fick’s law 


X\ dc 
dis = |i = > 22 7.2) 2 es 
y (=) 5 (8) 


and hence the relation (1) for D. We note that terms of order 0%c/dx? and 
higher are being ignored in order to obtain D. 

If [is the average number of jumps made by an atom in unit time, the 
total number of jumps after timetis V=Tt. Let; denote the x component 
of the jth jump of an atom. Then 


H-( Se i) = Salt DSay a 


Deal. | 
Since the averages are over a large Tein of atom ay the last term will 
be zero, if we ignore possible correlations between the directions of succes- 
sive jumps (Bardeen and Herring 1951, Compaan and Haven 1956, Le 
Claire and Lidiard 1956) for positive and negative products will occur 
with equal frequency. Suppose now that when an atom makes a jump 
it has a choice of s directions in which it can move. Let I’; be the rate at 


8 
which atoms make jumps in the direction i, so that '= ST,, and let a; be 


i= 
the x component of the corresponding jump vector. Then 


= Sup =Naea( NIT ) ¥ Pae=tST ee ssa been Oe 
so that we have for D 


Ds by DG? mn ge ae ee eee ie 
In self diffusion the I, are all eeuhe in cubic crystals but not necessarily so 
an anisotropic crystals. For cubic crystals we may therefore write 


D=30, ¥ a2=4T22 . . ° . ° . (12) 
J=sil 


where in the last expression #,? is now an unweighted mean. 


Note added in proof. Random walk’ treatments of self diffusion 


have also been given, among others, by Kramers (1940) and 
Chandrasekhar (1943). 


Random Waiks and Drift in Chemical Diffusion 925 


§ 3. CHEMIcAL Drrrusion 


It is well established that both self-diffusion coefficients and chemical 
diffusion coefficients vary with concentration so that the atomic jump 
rates must change continuously as we move along a concentration gradient. 
This means that we can no longer assume, as we did above, that the 
probability function f (X, ¢) is a function only of X and¢. The probability 
that an atom migrates a distances X in time ¢ will now depend also on the 
distribution of concentration throughout the whole region of the crystal 
which is accessible to the atom in time f¢. 

Generally, during the course of diffusion, this distribution will change 
continuously so that a proper specification of f for all ¢ would be very 
complex. We can however consider the special case of steady state 
diffusion, e.g. the steady flow of material into, through and out of a plate. 
Here, the distribution of concentration within the plate does not change 
with time when the steady state is reached. 

Under these conditions the probability that in time ¢ an atom starting 
from any given position will migrate a distance X, in addition to being 
a function of X and ¢, will depend also only on the position of the starting 
point and on the nature of the whole but steady distribution of concentra- 
tion. ‘The position of the starting point we can specify by the concentra- 
tion there so that for a random walk in a steady concentration gradient 
we replace f(X, t) by f(X, #, c(x)), which is the probability that an atom 
which starts from x where the chemical concentration is c(x) will migrate 
Xintimet. The precise form off will depend on the nature of the stationary 
concentration distribution to which it refers. 

f (X, t, c(x)) can be expanded about its value for atoms commencing 
their random walk at the reference plane 2», 


F(X, t, e(x))=f(, t, €(%)) 


af (ac, 1s es | 
+e %)+ 5 age ea 


fall (eee ea 


We now substitute this expression for f into eqn. (6), perform twice an 
integration by parts and divide by t to give J, the average rate of flow 
across %. We find for J = 


= oc Dee Mt Bo 
J = o(q)(/t) — 5 (=) —e(@) s(5. . =) 


1d%¢ (/X*\__ (dc\? (a X® id 


In this expression the moments are to be evaluated for atom paths 
beginning at 2p. 

Previously we knew from the symmetry of f (Xt) that its odd moments 
X, X? etc. would be zero. Whether or not this is true also for f (X, t, c(x)) 
depends on the detailed model of atomic jumps in a concentration gradient. 


926 A. D. Le Claire on the 


In general it will not be true. Therefore, since X is to a good approximation 
proportional to de/da (see later) we arrive at Fick’s law in chemical diffusion, 
that J is proportional only to the concentration gradient, provided we can 
ignore as being negligible terms in 0?c/0x?, (dc/ dx)? etc. In self-diffusion 
we were obliged to ignore only terms of higher order than these. Fick’s 
law is therefore for chemical diffusion a grosser approximation than for 
self-diffusion and we might expect the law to break down first for diffusion 
in sufficiently steep chemical concentration gradients. It would be 
interesting to study this breakdown experimentally, perhaps by looking 
for a time dependence of the apparent diffusion coefficient during the 
initial stages of diffusion across an interface separating two materials 
the compositions of which differ as much as possible. It would be preferable 
to study the diffusion of an interstitial solute, for in substitutional solutions 
Kirkendall effect phenomena, particularly non-equilibrium vacancy 
concentrations, might well play a predominant part in producing a time 
dependent D (Seitz 1955). 

In that follows we shall assume that the gradient is small so that all but 
the first three terms of (14) can be neglected : we can then define a chemical 
diffusion coefficient as 


D = — o(a9)(X/t)(Ac/8x) + X?/2t + ¢(29)(0/Ac)( X?/21). (15) 


This expression for D replaces for chemical diffusion the classical 
Einstein expression which is appropriate only for self-diffusion. 

Equation (14) shows that in general the nett rate of flow across a given 
plane is characterized by a drift motion of the atoms, represented by the 
term in X, superimposed on the otherwise random motion of the atoms, 


represented by the X? term as in self-diffusion: there is an additional 
contribution arising from the change in the extent of the random motion 
with concentration and therefore with position—the third term in egn. (14). 
Further discussion of (14) or (15) must be based on particular models 
of the jump process and the dependence of its rate on concentration. 
For this purpose we shall need to evaluate X and X2. 
X is given by 


7h - s 
j= t=) 
X? is again given by (9) but the second term >>d2,x,; is no longer zero 
. . . j me 
when atoms experience a drift, i.e. when they have a tendency to jump 
in one direction in preference to the opposite along the concentration 


gradient. It can be shown quite generally that, if N is large 
X? = Na?+ (X)?, io ee ere ier 


Consider as a simple example a one dimensional crystal with lattice 
site spacing a. Let p, and p_ be the relative probabilities that when a 
jump is made it is by +a or —a respectively. Then X =(p,—p_)Na. 


Random Walks and Drift in Chemical Diffusion 927 


"There are V(N —1)~ N? terms in the sum > Y2;x; and each is of absolute 
value a?. The probability that a term is positive i is p,*+p_? and that it 
is negative is 2p,p_. The summation term is therefore 


DDxty = NX p,2+p_?— 2p, p_)a® = (X}P 
and hence equation (17). 
However, in the example we discuss later (XY)? is much less than 


Nx?(~4:5%). We shall assume this is generally true and therefore 
ignore it, leaving eqn. (10) still valid for diffusion with drift. (The error 
involved is smaller than that arising from ignoring correlation between 
the directions of successive jumps, which is ~ 20%.) Since it is pro- 
portional (v.i.) to (de/dx)?, a value of (X)? comparable with X2 would 
lead to a term in J proportional to (dc/d)? and so contribute to the break- 
down of Fick’s law in chemical diffusion. 

In arriving at the above equations, the averages are taken over a large 
number of atom paths each of which involves the same number N of 
jumps. Also, the set of values [; (or p, and p_) is assumed to be the same 
at each lattice point. Strictly, these conditions do not hold for a random 
walk in a chemical gradient. The I; change continuously along the 
direction of the gradient and NV, the number of jumps made in a given time, 
will vary with the direction of the path. However, for sufficiently small 
t, so that not too extensive ranges of values of [; or of V are involved, we 
shall assume as a reasonable approximation for our present purpose that 


the true values of X and X? do in fact approach those given by (16) and (10). 


$4. CaLcuLaTION oF X, X? anp D For A SIMPLE MopEL 


We shall first discuss a simple model which reveals the essential features 
of random walks in chemical diffusion and then generalize the results in a 
later section. 

The central height of the potential barrier over which an atom jumps in 
moving from one site to another we assume to be determined by the average 
of the concentrations at these two sitest. This height will vary as we 
move along the concentration gradient and if this were the only factor 
influencing the random walk we could represent the situation by the 
potential barrier diagram of fig. 2. Between two sites, say 1 and 2, the 
rate of jumping T is the same for the direction 1— 2 as for 21, and by 
definition is equal to the rate of jumping in a homogeneous alloy of 
composition corresponding to a point midway between | and 2, i.e. the 
self-diffusion jump rate. 

Each atom will clearly experience a drift on account of the changing 
barrier height for an atom at a given site will always move with higher 
probability to the right than to the left in fig. 2. 


+ By “ concentration at a site ” is meant of course the average concentration 
on the lattice plane through that site and normal to the direction of the 
concentration gradient. 


928 A. D. Le Claire on the 


But increasing the concentration gradient while keeping constant the 
average concentration between two sites is unlikely to leave the jump rate 
between them unaffected and we therefore expect also some dependence 
of the jump rate on the differences in concentration at the two sites. 


Fig. 2 


5 


Fig. 3 


Let us assume that the affect on IT of a difference in concentration arises 
because the solid solution is not ideal so that there is a change AF in free 
energy when an atom moves from one site at one concentration to another 
at a different concentration. As a result of this the effective height of the 
potential barrier is increased by $}AF for a jump in the direction which 
results in an increase in energy and is reduced by the same amount for 
the same jump in the opposite direction. The effect on the potential 
barrier diagram is as shown in fig. 3. The vertical height e.g. H,.01 
each barrier is the same as in fig. 2 but the minimum on one side of each 


Random Walks and Drift in Chemical Diffusion 929, 


barrier is raised by }AF and on the other side depressed by the same 
amount. A jump from 1-2 then produces a nett energy change AF. 

This effect clearly provides an additional cause for drift since an atom 
can now jump even more readily to the right than to the left from any 
given site. 

These two components of the drift X, one due to the variation of jump: 
rate with average composition, the other to its variation with difference 
in composition, can of course be oppositely directed. As drawn, fig. 3: 
provides for both components being in the same direction. 

In calculating X and X? we shall assume a vacancy mechanism of 
diffusion. There is then a possible third component of drift if there is a 
gradient of vacancy concentration for atoms will tend to move towards 
regions of high vacancy concentration. However we shall see that if the 
concentration of vacancies is everywhere that corresponding to equilibrium 
with the local chemical concentration this drift component is zero, as 
might be expected. 

Let [',, be the rate of jumping of atoms from site 2 into a vacancy on 
site 3 and N,, the fractional vacancy concentration on site 3. Then 
for a 2-3 jump I; in eqns. (10) and (16) is Ny,.T,5. Ifa denotes the 
interatomic spacing along the concentration gradient then for atoms 
starting at site 2 


X,=taNy 3093 —aNy i024) 
= atNys(Taq— Tan) + 20°T tt Peay is. 


where I’,* is the rate of jumping into vacancies in a homogeneous alloy 
of composition equal to that on site 2. Similarly, to a first order 


oe =U(a® Ny 303+ a Ny iV 21) 
=O Nolo oes Pare, meee (.LO) 


Let pu’, denote the free energy per atom of the diffusing species A at 
position 2, less the contribution due to entropy of mixing. p’y.. denotes 
the corresponding quantity for a vacancy at 2. When an atom jumps from 
2-+3 a vacancy moves simultaneously in the opposite direction so that 
the energy change AF is 


AF ,.3= (H'n3—F 49) ate (e've—-K yva)= ( As. a 


If E denotes the central height of a barrier the total effective barrier heights 
for jump 2-3 and 2-1 are 


D / F) 
Qoe= Laat 3155 set Oe x gen 


| 

Oc Oc (20) 
a 1 fay’,  Op'y| de | 
Qoa = Ly4- a ac oc bie | 


930 A. D. Le Claire on the 


The rates of jumping T',, and I, can be written 
Tyg=v exp (—Qo9/kT) and T,,=v exp (— Qaiiwe) 


y being the vibration frequency of an atom at 2. Since Q5.3—Qo, will be 
very small we can write for the difference between the I's 


Tas—Vaa=—T2*- (Qes—Gaa/AT. . . - » (21) 


Substituting (20) into (21) and then (21) into (18) and making use of the 
relation for the complete chemical potential of the vacancies, 


fy =e’ y+kT In Ny, we find for X 


c_ fea va) |} u's . 1 x3) _1 Hy) Oe 22 
where X? is 

X?=2a*tNyv exp (— H,/kT) 5 ee a oe) 

TS Gye ee BEL I 24 

and poe ‘5 Te at | /2kT). . ts ee 


The three terms in X correspond to the three drift components discussed 
above. The first derives from the variation of jump rate with average 
composition, the second from its dependence on the difference in composi- 
tion and the third from the vacancy gradient. When (22) is put into (14), 
the equation for J, we find 


V2 ! 
Jail oe bt a ee oe ee 


When the vacancies are in local thermodynamic equilibrium their 
gradient is such that j, is everywhere zero and all terms in p, disappear 
from the above equations. Also X2 can be replaced by X2*, its value in 
self-diffusion. We then arrive for the chemical diffusion coefficient at 
the usual Darken equation 


Xe Olny dln 
D= 1 JM pee VA : 
at { i a 2 {1+ ae i we £(20) 


where y, is the activity coefficient, defined by p’,=k7'In yy. 
It is to be noted that the first component of the drift, the term in 


(0/dc)X*/2t has disappeared in the expressions for nett flux J and for D. 
That X? varies with ¢ produces on the present model both a contribution 
to the drift of atoms and a change in the rate of spreading out of atoms 
from a given plane as we move along the concentration gradient—the 
last term in the top line of (14). To the degree of approximation employed 
in the present calculation these two effects cancel out in J. This property is 
illustrated in fig. 4. On account of drift alone the two curves of fig. 1 are 
now each displaced by X from a, and x, respectively. There is an increase 
in J because the difference of areas A-B is now larger than before. But 


Random Walks and Drift in Chemical Diffusion 931 


because of the increase in rate of spreading as we move along the x-axis 
the curve about x, should be wider than that about «, and so is redrawn 
as a dashed curve. On account of this the nett flux J is reduced by the 
area C. The calculation shows that the area C is exactly equal to that 
part of the increase in A~B which is due to the (0/c)(X2/2#) component of 
drift. (The curves are shown as Gaussian for simplicity but in fact they 
will be skewed to the right. This does not affect the point of the argument.) 


The term (D*d ln y,/d In c) in the Darken equation may then be 
interpreted as a drift component superimposed on the otherwise random 
motion of the atoms. It is important to appreciate that this is only one 
component of the total drift motion experienced by each atom. . There 
are two components of drift, assuming the vacancies are in equilibrium, 
but only one is manifest in a measurement of a diffusion coefficient. Only 
when D* is independent of concentration does (D*d In y,/d In c) represent 
the total drift. ange A 

Expressions can be readily derived for X, X? and (0/dc)(X?) for the 
vacancies themselves and these can be shown to combine in eqn. (14) 
to give for the nett flux of vacancies in a binary alloy of A and B the usual 
equation 


oN 

J,=(D,—D;) =". ee Po rebar ca (27) 
In arriving at (27) the vacancies are assumed to be everywhere in local 
equilibrium so that »y=0. In the full expression for J, all terms derived 
from X,? and its variation with concentration disappear and Jy ineqn. (27) 
is constructed entirely from the remaining parts of X terms. The dis- 
placement of Kirkendall markers can then justifiably be described as 


due to a ‘drift of vacancies’. 


932 A. D. Le Claire on the 


§ 5. EXPERIMENTAL MEASUREMENT OF DRIFT OF ATOMS 


A recent experiment by Shuttleworth} confirms that the drift suffered 
by individual atoms in a chemical concentration gradient is in fact fairly 
well represented by eqn. (22). 

A thin foil of ~17°% zine «-brass was n-irradiated to produce copper 
and zine, welded between a pair of discs, one of copper and one of 27% 
w-brass and then annealed to diffuse. The distribution of radioactive 
copper and zine and of total zine were then determined by counting and 
analysing sections sliced from the sample. The results are shown in fig. 5. 


Fig. 5 


FINAL Zn. CONCENTRATION 
5 20 155 1035 0-25 


COPPER 


DIFFERENTIAL 
OF CHEMICAL 
CONCENTRATION 


7 DISPLACEMENT 


a A 
Zn. MEAN : 
DISPLACEMENT |: Ree Ne 


0:20 O15 0:10 0.05 @) 0-05 O10 
PENETRATION IN CM 


The dotted line represented the distribution of radioactive zine and the 
dashed line that of radioactive copper. The first moments of these 


distributions about the position of the marker interface give Xe and 
et 


x zu the average displacements in the lattice of radioactive copper and 
sa atoms from their original positions (the double bar denotes an average 
of A over the continuously changing concentration gradient necessarily 
experienced by atoms in an experiment of this type). Thus e.¢ 

wD Jone 


See Secu (x)x da ‘ 
a> aide (28), 
The vertical dotted line in fig. 5 is drawn at a distance ee and the: 
—— n 


dashed line at a distance X,,, from the markers. 


: A a Soke a Shuttleworth, (to be published.) I am indebted 
O DY. AUTtLEW Ort his kk Bee Meee -) e } 
SSE rth for his kind permission to make use of his results prior to: 


Random Walks and Drift in Chemical Diffusion 933 


We note that the drift displacements are roughly of equal magnitude 
and both in the same direction, down the copper concentration gradient. 
This result is incompatible with any interpretation based on Darken’s 
equation alone. Darken’s equation reveals a drift flux of amount 


ps dln y dc 
Olnc ox 
and therefore a drift displacement of 
= dlny dc 
Has SUD ia ina areal bese aC 
dc 0x ey 
in time ?. D*dIn y/dc for zinc is about ten times larger than it is for 
copper but in both cases is positive. Therefore, since dc,,= -—dc,,, we 


should expect the drift for copper to be down the copper concentration 
_ gradient and that for zinc to be ten times greater and in the opposite 
direction—down the zine concentration gradient. This being contrary 
to experiment confirms that that component of drift evident in Darken’s 
equation cannot be the only one. We turn then to egn. (22). 

The last term of (22) we shall ignore for, if Vy, is the equilibrium vacancy 
concentration, 

1 Quy OAln (Ny/Ny,) Ny—Ny, 
kT 0c Oc ao 0-3Ny, 


and this last quantity must be of order unity to compare even with the 
smaller of the two other terms of (22) (see the table). This requires that 
the vacancy concentration be ~30°% above equilibrium, which is much 
greater than estimates (~ 1° or less) of the excess vacancy concentration 
in couples of this type (Balluffi 1954, Barnes and Mazey 1958). 


Let us now express X,, and X,,, as 


=. ——{(dlnD,,* dln Oc 

XY aprile Pa" _ eta zn = 30 
os Di Oean Ocz, J Ox 02) 

= ——— ra) In De es fa) In Vo OC 

RE SSI hae oe 30 b 
Cu Cu { OCou OCon } Ox ( ) 


The first three terms on the right-hand side of each equation are averages 
over the whole range of concentration and dc/dx is to be averaged over the 
concentration range and over the time. The precise determination of 
these average values is a difficult problem in concentration dependent 
diffusion theory so that we shall be content with approximate estimates. 


For D*t we take simply one half of the second moments of the copper 


and zine distributions in fig. 5 viz. 1X2 and 4X,,?, evaluated about the 
marker interface. 

Diffusion coefficients in the «-brass system generally vary approximately 
logarithmically with composition so that dIn D*/dc is fairly constant 
over the diffusion zone. Its values for copper and zinc have been estimated 
from the data of Inman ef al. (1954) and Hino eé ad. (1957). 


934. A. D. Le Claire on the 


dln y/dc was estimated at 10, 20 and 25 atm% zinc from the data of 
Herbenar, Siebert and Duffendack as smoothed by Lumsden (1952). 

The average value of dc/dx across the diffusion zone at the end of the 
experiment was estimated from the full curve in fig. 5 to be ~ 75 atm%/cm. 
Assuming c to be a function of « and ¢ in the combination x/Hl2, the time 
average of de/dx at any given concentration is twice the final value of 
dejdx. The space and time average of dc/dv we therefore take as 
150 atm%/cm or 1:5 atm fraction/em. 

The table shows the values of these several quantities, together with the 
experimental and calculated values of X,, and X,,, the latter being 
calculated for the 20 atm% value of dIny/dc. The agreement between 
the theoretical and experimental values shown in the last two columns is 
as good as can be expected in view of the approximations made—perhaps 
surprisingly sot. In particular the theoretical drifts are both in the same 
direction and approximately equal. The experimental result that 


Xo, >Xz, i8 not reproduced theoretically but since the difference is in 
any case small some refinement of the theory would be necessary to achieve 
this. Taking into consideration the possible third component of drift, 
due to a non-equilibrium vacancy distribution, does not help here. 


Se 0 In y/ée 1¥2_ Ds e 
0c] Ox / Cin DEN Ae 
atm fract/em Ge JBES Theder| aE 
10% 20% 25% Zn. he eo 
Zn —1:5 3:6 4-6 5:6 10-5 | 0-0018 | —0-016 | —0-014 
Cu +1-5 O40) Sa ae 1es7 —10-1 0-0008 | —0-014 | —0-017 


This drift component would be directed towards the zinc-rich side of the 
sample (—ve x) for both zinc and copper and being proportional to D 
its effect would be only to enhance the difference between the theoretical 


values of X,, and X,,, without changing their order of size. This 
vindicates the neglect of this term. 

It appears then that that component of the drift which is due to the 
variation of D* with composition is for both zine and copper greater than 
the other component due to the solution being non-ideal, and therefore 
dominates the drift behaviour in this system. 

The drift displacements can also be compared with the Kirkendall 
marker shift which was measured in the same experiment. The marker 
shift is given by 


: Oln 0c 
ree Oe AE wy ent eels orl) 
rn a 14 a amet (5), 


} Dr. Manning (University of Illinois) has more recently measured the drifts 
of silver and of cadmium in a number of a—AgCd alloys and his results provide 
further confirmation of the present theory (private communication) 


Random Walks and Drift in Chemical Diffusion 935, 


where the D’s and (dc/0x),,, are to be evaluated at the marker composition. 


(dc/dx),,~ 120 an em and D,,*~3D,,* so that if we assume 
Dy,* oe 3X7," r 
; igure) (oe 
eas Olnc J \ox 0-35 
baa {° InDyy An yn) 
Aczn OCzn 

Similarly : 

=” = 0-58, 

Xo 


These estimates are in reasonable agreement with the experimental value 
of about one half. 

It will be clear that the simplest type of drift experiment to interpret 
theoretically would be one in which the drift is allowed to take place in a, 
steady concentration gradient, i.e. under conditions of steady state 
diffusion. One might establish a steady rate of flow of solute into, through 
and out of a solvent plate, then cut the plate in two normal to the direction. 
of diffusion, deposit a thin layer of radioactive solute on one of the cut 
surfaces, reweld the two surfaces together, with markers, and continue 
the steady state diffusion anneal. Such an experiment could be carried 
out either for a substitutional or an interstitial alloy system. 


§ 6. GENERAL DISCUSSION 


We have seen that chemical diffusion differs from self-diffusion in that, 
superimposed upon the otherwise random spreading out of atoms from 
any given region there is a tendency for atoms to drift in one direction 
or the other, i.e. the expression for J contains both X and X? terms. 
There are two (or three where vacancies are not in equilibrium) components 
to X but only one of these is manifest in a diffusion coefficient measure- 
ment. This latter result is not peculiar to the model we have chosen 
but is true at least for any model in which the rate of jumping between a 
pair of sites is a function only of the composition at those two sites. 

Denote the composition at a site from which a jump is made by « and 
that at the site to which it is made by w. Then we suppose the rate of 
jumping to be a function I'(a,w). Any such function can always be 
expressed as a function of the sum and of the difference of the arguments 


T(a, w)= I’(at+w, w—«) ae eri (al » 
and or ol’ CeO et Ole ol” (32) 


= 


Gu O(atw) O(w—a)’ dw Oatw) A(w—a) 
The average displacement from a site on the plane at x= 0 is then from (16) 


KatS xP(o(0), olay) =tDa {Te}, (0) + Fee 


i=1 


=1(S0P 5 ae 


936 A. D. Le Claire on the 


for ['(c(0)c(0)) is the rate of jumping in a homogeneous alloy of composition 
¢(0) so that Ya,T\(c(0), c(0))=90. Using (32), (33) can be written 


* or,’ or,’ 0c 
= 2 i 2 
X=1¥ O(a + w) se A(w oe On 


X is the sum of two terms, one depending on the variation of jump 
rate with average composition, the other on its dependence on difference 
in composition. 

Similarly a 
X*%=t>uZ2T,(c(0), e(a,)) =t > xP T(c(0), c(0))= xX* for Sa30T;/dw=0. (35) 


(34) 


Ther ar, er, 
ee aX*/B0=t 5m? pee Aap 
or, using (32) i aX? pe ae 36 
Inserting (34), (35), and (36) into ae 
D= D*—c(0 (OE ae a oe (37) 
@'— & 


from which the first component of the drift in (34) is absent. 
In many crystals the [; are all equal so that 0T;/0(w#—«) can be taken 
outside the summation to obtain 


(38) 


D=D* (1-2 din I ) 


O(w — x) 


which is a generalized form of Darken’s equation. 

An assumption implicit in the treatment so far is that the random walk 
behaviour of an atom of a given species in an alloy is determined entirely 
by the concentration of that species alone, for the probability functions 
F(X, t, c(~)) and the jump rates I'(«, w) contain reference only to the 
concentration of the diffusing species. This is clearly inadequate in 
systems of three or more diffusing components in substitutional solid 
solution (or of two diffusing components in interstitial solution). The 
probability that an atom of a particular species k migrates a distance X 
in time ¢ will in general depend on the concentration distribution of each 
species present and for a given distribution can be written f (X, t, c,(x) .. . 
c,(x)) where c,(z) denotes the concentration of 7 at the starting point 2 and 
n is the number of components whose concentration must be specified 
fully to describe the composition. Expanding this function about its 
value for atoms starting their random walk at the reference plane a, to 
give an equation analogous with (13), substituting the expansion into the 
bene eqn. (6), and integrating, we obtain for the rate of flow of the kth 


component 
X;, ESOC B00 (XG oN AOC eee 
J.= =. aN pa nay Leuba k ss k k 
P eal) 4 > 5a (7 )} se oC) 


which is the generalized form of eqn. (14). 


Random Walks and Drift in Chemical Diffusion 937 


Tf now it is further assumed that the rate of jumping of species k between 
two sites is a function of the concentration on these two sites of the 
species, I(a, w) is replaced by 


ie . — 
Dj (Otg, Og». + Oy Wy Wass» Wn) = (a+ ay, . . pt Wn; 
Wy — 1,» -» y= On) 
and eqn. (32) by a similar set, » in number for each k. 


X,, and X,? can then be evaluated as before to lead eventually to the 
familiar set of equations for describing multicomponent diffusion 


WSS Dig! ee Be ey 
“) 


where, if the T; ; for any given & are all equal, 


dln I; 
D,,=D,* (1—2c,~——** };. . . . . (4 
kk k ( Cea) (4.1) 
and 

din T ie 


The diagonal terms D,,;, are identical in form with (37) while the cross 
terms D,; are drift terms determined by the dependence of the jump rate 
of species k on the difference in composition at the two sites of component j. 
A measurement of the drift of species k would reveal a second component 


Dyj= —26,D 4" Eo. ea pee 42) 


OC; 
Dae Data D’,, ./0(w; +;)) 


but this again does not contribute to the nett rate of flow. 

The dependence of [; on the difference in composition at two sites we 
can again associate with the change in free energy when an atom jumps. 
X then becomes for the kth component, 


K-21 (s) TUL SENET aaa ga a 


j 


in place of (22). j4;,’ is the chemical potential of k, less the contribution 
from entropy of mixing. J/,;, is therefore 


Xe 1] UG OL, dc; Fe Ge OC, 


J,= —C, eS yee 
é cae kT j= Oc, Ox Zia Ou. 


(44) 


Since p,’=,—kT Inc, , being the complete chemical potential of /. 
the last equation can be written like (40) with 


Xi? Ce Ops Ad 
BET aDGE GT. a, ey) 
or simply as 
Xa Cr Op, AG 
a at kT On bal 


P.M, 3° 


938 A. D. Le Claire on the 


An alternative description of multicomponent diffusion, obtained from 
consideration of the thermodynamics of irreversible processes, leads to the 
equations (see e.g. de Groot 1951). 


Op; 
ToS 1 ee 
k > kj Ox 


Comparing (47) with (46) we see that L;,,=(X,2/2t)(¢;,/kT) but that the 
cross terms L,; relating the rate of flow of a species k to the chemical 
potential gradient of a species j, are all zero. The model we have used 
is not then sufficiently detailed to reveal any effects represented by the 
cross terms in (47). 

According to Bardeen and Herring (1951) cross-terms appear only 
when ‘correlation’ effects are taken into account; these have so far been 
neglected in our discussion. From the nature of vacancy diffusion 
(although not peculiar to it) the direction of one jump of an atom is not 
independent of the direction of its previous or earlier jumps, for the 
vacancy which effects one jump is available to effect the second jump 
in the opposite rather than in a random direction. Successive jumps 
are said to be ‘correlated’ with the result that the summation term 
>>2;x; in eqn. (9) for X? is not zero, as we assumed it to be, even for 
self-diffusion. Calculations show that for self-diffusion the true value 
of X? is the right-hand side of (10) multiplied by a geometrical factor 
of roughly 0-8, the ‘correlation factor’. 

In chemical diffusion }Sv,x, is in any event not zero but correlation 
will contribute further to it and in amount of the same order of magnitude 
as in self-diffusion. The important point is however that taking correlation 
into account will only modify the value of X 2 that is to be inserted into 
the general eqn. (39) for J;,.. Proceeding then from (39) to (46) the correla- 
tion factor will be carried through to appear only as a modification of the 
diagonal terms L,;.. It can in no way appear wholly in cross-terms. 
However, the detailed calculation of X,2 for chemical diffusion would no 
doubt reveal in S$ a,v,;, terms proportional to the concentration or 
chemical potential gradients of species other than k. These terms could 
be separated out from L,,. to yield cross-terms but they would be of 
second order because they would carry with them the coefficient 0u,/0x : 
this would furthermore be an unnecessary complication of the equations. 
If cross terms are extracted in this way then it appears, at least with the 
present model, that they only represent the effect on correlation of intro- 
ducing the chemical concentration gradient: the diagonal terms always 
contain the main effect, determined by the relative concentration and 
jump rates of the several species present, and in the simplest representation, 
without cross terms, they contain the whole effect. 

This conclusion is contrary to that of Bardeen and Herring who con- 
sidered that the whole effect of correlation was contained in the cross terms. 


There may well of course be other contributions to the L ~j Which only 
a more detailed model would reveal, . 


Random Walks and Drift in Chemical Diffusion 939 


ACKNOWLEDGMENTS 


IT am indebted to Dr. H. M. Finniston for his interest in this work and to 
Drs. A. J. Mortlock, G. V. Kidson and W. M. Lomer for many helpful 
comments during the preparation of the manuscript. 


REFERENCES 


BaAuuuFFi, R., 1954, Acta Met., 2, 194. 

BARDEEN, J., 1949, Phys. Rev., 76, 1408. 

BARDEEN, J., and Herring, C., 1951, Atom Movements (American Society 
for Metals). 

Barnes, R. S., and Mazzy, D. J., 1958, Acta Met., 6, 1. 

CHANDRASEKHAR, 8., 1943, Rev. mod. Phys., 15, 1. 

ComPAAN, K., and Haven, Y., 1956, Trans. Faraday Soc., 52, 786. 

DarRkEN, L. 8., 1948, Trans. Amer. Inst. min. (metall.) Engrs, 175, 184. 

Ernstern, A., 1905, Ann. Phys., 17, 549. 

DE Groot, 8S. R., 1951, Thermodynamics of Irreversible Process (Amsterdam : 
North Holland Publishing Co.). 

Hino, J., TomizuxKa, C., and Wmrrt, C., 1957, Acta Met., 5, 41. 

Inman, M. R. C., Jonnston, D., Mercer, W. L., and SHUTTLEWORTH, R., 
1954, Oxford Radioisotopes Conference, vol. II, p. 85. 

Kramers, H. A., 1940, Physica, 7, 284. 

Le Crate, A. D., 1953, Progress in Metal Physics, vol. TV (London: Pergamon 
Press), p. 245. 

Le Cuaree, A. D., and Liprarp, A. B., 1956, Phil. Mag., 1, 518. 

LumspeEn, J., 1952, Thermodynamics of Alloys (London : Institute of Metals). 

Seitz, F., 1955, J. phys. Soc. Japan, 10, 679. 


1S) 
Mm 
ie) 


; 940 ] 


The Diffusion Constant, Mobility and Lifetime of Minority Carriers 
in Germanium containing Parallel Arrays of Dislocations; 


By J. B. Arruur, A. F. Grsson, J. W. GRANVILLE and E. G. 8. PAIGE 
Royal Radar Establishment, Malvern 


[Received April 28, 1958] 


ABSTRACT 


The presence of a high density of parallel edge dislocations in N-type 
germanium is found to significantly enhance the diffusion of holes in a 
direction parallel to the dislocations. The apparent diffusion constant is 
therefore anisotropic. In P-type germanium on the other hand the diffusion 
constant is isotropic and the carrier lifetime anisotropic. 

At high electric fields the drift mobility of holes in N-type germanium is 
found to be anisotropic with respect to the dislocation array, no comparable 
effect occurring in P-type material. 

These results can be explained by a model which assumes that dislocations 
introduce an additional acceptor level approximately intermediate in energy 
between the conduction and valence energy bands. 


§ 1. INTRODUCTION 


In a recent paper Bell and Hogarth (1957) have shown that the diffusion 
length of minority carriers in germanium and silicon, measured by the 
travelling light spot technique, may be anisotropic if the crystal contains 
a high density of edge dislocations parallel to one another. The diffusion 
length measured parallel to the dislocation array was typically a factor 
of 2 or 3 greater than that measured perpendicular to the array. In 
addition Bell and Hogarth observed that the decay of photoconductivity 
in filaments or rods cut parallel and across the dislocations was significantly 
different, the latter having the lower apparent lifetime. The authors’ 
demonstrated, however, that the latter was a surface effect. 

In interpreting the anisotropy of diffusion length, Bell and Hogarth 
assumed that the diffusion constant, D, was isotropic and equal to the 
normal value given by the Einstein relation : 
kT 

5 (1) 
where p is the drift mobility of the minority carriers, k Boltzmann’s 
constant, 7’ the absolute temperature and q the electronic charge. All 


the anisotropy observed, therefore, resided in the deduced carrier 
lifetime, r. 


D= 


It is well known that dislocations in germanium or silicon crystals act 
as efficient recombination centres (Wertheim and Pearson 1957 ). In the 


{+ Communicated by the Authors, 


On the Minority Carriers in Dislocated Germanium 941 


travelling light spot experiment, however, the total path length of a 
carrier is of the order of 104 times greater than the diffusion length which, 
in the crystals used, is itself an order of magnitude greater than the 
average separation between dislocations. Hence the probability of a 
carrier encountering a dislocation is virtually independent of its point of 
origin. Bell and Hogarth pointed out that significant anisotropy could 
be obtained on the above model if two additional assumptions were 
made, namely : 


(a) The dislocations were largely polygonized into ‘ walls’; and 


(6) The dislocations were surrounded by a potential barrier which 
tended to exclude minority carriers from the high recombination region. 


Assumption (a) essentially reduces the model to a two dimensional one 
and ensures that carriers diffusing at right angles to the dislocation array 
must cross at least some dislocations. Assumption (b) restricts the 
random walk of carriers diffusing parallel to the dislocations and ensures 
that few, if any, of them cross a dislocation. On this model any anisotropy 
of from 1 to co may be obtained, depending on the degree of polygonization 
and the effective barrier height. 

The object of the work to be described in the present paper was, 
primarily, to check the validity of the assumption that the diffusion 
constant, D, is isotropic and equal to the normal value in heavily dis- 
located material. Two approaches to the measurement of D have been 
made, namely drift mobility measurements and the simultaneous 
measurement of phase and amplitude in the travelling light spot experiment. 
We shall show that in N-type germanium DP is not isotropic and the 
results allow a more complete model to be built up. In P-type germanium, 
on the other hand, D is isotropic and all the anisotropy appears to reside 
in tr. In addition measurements of drift mobility at high electric fields 
will be described, the results obtained giving valuable quantitative 
support for the model proposed. 


§ 2. THe MeasurReMENT or Hoxie Mosiniry anb DIFFUSION CONSTANT 
BY THE Drirr METHOD 
2.1. Drift Mobility : Experimental 

A conventional drift mobility equipment was used (Haynes and 
Shockley 1951), the pulsed field, pulsed emitter arrangement being 
adopted. Filaments were cut from N-type germanium crystals with 
‘ grown in’ edge dislocations (Bell and Hogarth 1957), the major filament 
axis lying parallel or normal to the dislocation array. Emitter and 
collector contacts, end connections and voltage probes were attached by 
conventional alloying techniques. 

The relatively low minority carrier lifetime of the dislocated material, 
together with the non-uniformity of the dislocation density, markedly 
reduced the accuracy and reproducibility that can normally be obtained 
in this experiment. In addition it was necessary to use relatively high 


942 J. B. Arthur et al. on the 


resistivity material (~30ohm cm) to maintain the lifetime at an acceptable 
value, which increased errors due to conductivity modulation. To 
minimize the latter effect measurements were made at various emitter 
currents and the results extrapolated to zero emitter current. 

The diffusion constant of the carriers was deduced from measurements 
of the width of the carrier arrival pulse at the collector (Lawrance and 
Gibson 1952). The above sources of error, particularly conductivity 
modulation, are about an order of magnitude more important in the 
determination of D than in p. 


2.2. Drift Mobility : Results 
The results obtained may be summarized as follows : 


(1) The drift mobility parallel to and across the dislocation array was 
the same in all pairs of filaments and equal to the mobility in normal 
undislocated, material of the same resistivity. 

(2) In some pairs of filaments the diffusion constant, deduced from the 
width of the hole pulse at the collector, was significantly greater than the 
normal value parallel to the dislocations but approximately equal to the 
normal value across the dislocations. Unfortunately a relatively long 
carrier lifetime is required if an accurate value of D is to be obtained by 
this technique. A long lifetime implies a low dislocation density and 
hence small anisotropy, so an uneasy compromise has to be reached. 
Illustrative of the results obtained from such a compromise are those 
given in table 1 for an N-type germanium crystal (designated PH34), 
which was also examined by Bell and Hogarth (1957). 


Table 1 


Filament cut Filament cut 
parallel to dislocations normal to dislocations 


Mobility | s4,=1850 + 100 em? volt! see | uw, =1950 + 100 cm? volt-1 sec-! 
Diffusion 


constant Dy = 400 ale 100 cm? sece7t D, = SO a 30 em2 see! 


Crystal PH34. Resistivity ~30 ohm cm. 


It may be shown, by general arguments, that any disturbance (e.g. 
non-uniformity of filament cross section or resistivity) will increase the 
width of the collector pulse and hence the apparent value of D. It seems 
likely that the moderately high value of D, is due to non-uniformity or 
similar effects. The very large value of D, cannot be explained so easily, 
however, and suggests that the dislocations materially assist the rate of 
diffusion of holes. In view, however, of the difficult compromise on 
which this experiment is based the results cannot be considered conclusive 
without additional evidence from the travelling light spot experiment. 


Minority Carriers in Dislocated Germanium 943 


§ 3. THe MEASUREMENT OF Dirrusion ConsTANtT AND LIFETIME BY 
THE TRAVELLING Liau Spor Mernop 
3.1. The Travelling Light Spot : Experimental 

Tt has been shown by Avery and Gunn (1955) that if the phase and 
amplitude of the signal in a travelling light spot experiment are measured 
simultaneously the values of D and + may be obtained uniquely. The 
phase shift, which is a measure of the transit time of carriers from the 
light spot to the detector contact, naturally increases with the light 
interruption frequency. Alternatively, it is, in principle, sufficient to 
measure the amplitude only at 2 or more interruption frequencies. 

In the experiments to be described measurements have been made of 
amplitude and phase at 4kce/s and amplitude only at 800c/s. In this 
way three equations are obtained for two unknowns (D and 7) which 
allows a check on consistency to be made. With low lifetime dislocated 
material phase measurements at the lower frequency are of inadequate 
accuracy though they are used, with undislocated crystals, for periodic 
checks on the apparatus. 

We have chosen to use the data to deduce two values of D and one 
value of +. The observed quantities are signal amplitude, S, relative 
phase angle, @, and distance from the light spot to the contact, r. Then, 
following Avery and Gunn (1955), the relevant equations are as follows : 


_ a, [or or 5 

ame (i), Eero eet awed 

06 or pele 277) 41 (3) 
@iE In oe a Ee + wor?) Hl? + | -. 


where r, is the radius of the light spot and w, = 27 (4000) and w, = 27 (800) 
respectively. Independent values of D and 7 may be deduced from 
eqns. (2) and (3). When7is known a second value of D may be obtained 
from the following equation : 


and 


pe ae) = {Sere Lue 
27(00/0r)..,” 

In the present experiments a conventional travelling light spot equip- 
ment was used, the signal being fed to a phase-sensitive bridge. A phase 
reference signal for the bridge was derived from a phototransistor in the 
light beam, which was chopped mechanically. This arrangement ensured 
that the reference and signal frequencies were coherent regardless of drift 
in chopper speed. 

A schematic diagram of the phase-sensitive bridge and associated 
circuitry is given in fig. 1. The bridge is balanced by adjustment of the 
phase of the incoming signal. In a phase-shift circuit of the type shown 
the phase angle is given by 

tan 40= —wCR. 
With a high- triode, a maximum phase-shift of about 140° is obtained. 
For the measurement of signal amplitude the phase is adjusted to give 


944. J. B. Arthur ef al. on the 


2% maximum reading on the output meter but for id AOS of 
phase angle considerably greater sensitivity can be obtained by adjusting 
the signal phase to give zero deflection on the meter. 


3.2. The Travelling Light Spot : Results 

Some of the results obtained by the travelling light spot method on a 
number of N- and P-type germanium crystals are summarized in table 2. 
Figures 2 and 3 show some phase and amplitude measurements as a 
function of distance from the detector contact for an N- and P-type 
erystal respectively. From these results the following generalizations 
may be made. 

(1) In N-type crystals D, is equal to the normal value of D but Dd, 
is significantly greater. This result is in agreement with the drift mobility 
data. The apparent carrier lifetime is also generally greater parallel to 
the dislocations than across them, though this is not always so. 

(2) In P-type crystals the values of Dy and dD, are not significantly 
different from the normal value for electrons and all the anisotropy 1n 
diffusion length is due to anisotropy in 7. 


Fig. 1 


Signal in 


R amplifier 


amplifier 


Phase shift 
[circuit 


2 Stage 


Phase reference : 
glonal ram wide band 
: Lifi 
photo-transistor SOUR 


Circuit diagram of phase sensitive bridge. 


H.T.+ 


No significance may be attached to the actual values of D,, obtained 
in the N-type crystals. The crystals used were markedly non-uniform 
in dislocation density and only regions showing marked anisotropy in 
diffusion length were selected for detailed examination. Fortunately 
crystal uniformity is only required over distances ~1mm in this 
experiment, compared with 7-8mm in a drift mobility experiment. 
Short range non-uniformity, indicated by non-linearity of the phase and 
amplitude curves was observed occasionally, however, particularly in the 
1-direction. That these anomalies were a feature of the crystal could be 
shown by moving the detector contact and remeasuring, when the 


Minority Carriers in Dislocated Germanium 945 


anomaly reappeared in the same position of the light spot but at a 
different radius. As the theory upon which eqns. (2), (3) and (4) are based 
cannot be applied unless 06/dr and @In(Sr/ry)/dr are single-valued, 
measurements of this type were excluded from the results. 


Table 2. Values of D and 7 deduced AE (1) Amplitude at 800 c/s ; 
(2) Amplitude at 4 ke/s; (3) Phase at 4 ke/s 


N-type saa 


Parallel to Across 
dislocations dislocations 
Des) Das) 7(1,3) De2,3) Da,3) T(1,3) 
cm? sec—* em* sec sec em? sec—? om* sec sec 
116 130 19:3 50 49 10-0 
87 82 21 43 41 12-7 
89 af 33 46 41 33 
109 105 13 == = — 
LLO 110 48 78 55 11 
Js 8l Te 22 without background light 
‘| 102 86 14 with background light 
o4 66 24 without background light 
33 82 14 with background light 


P-type crystals 


X23) Da.) T De,3) Da,3) a 
88 93 55 99 79 22 
89 87 69 100 84 : 


§ 4. THe MEASUREMENT OF THE Drirr Mosiuiry or Houses at Hicw 
ELECTRIC FIELDS 
4.1. High Field Drift Mobility : Expervmental 

To measure the drift mobility of holes at high fields in N-type filaments 
cut parallel to and across the dislocations the technique described by 
Gibson and Granville (1956) was used. Precautions were taken to ensure 
that : 

(1) The injected hole density was kept low enough to reduce con- 
ductivity modulation effects to negligible proportions. 

(2) Only regions of constant crystal resistivity were selected for 
examination. 


(3) The non-uniformity of carrier lifetime along the filaments, due to 
variations in dislocation density, did not influence the results significantly. 


946 J. B. Arthur et al. on the 


Fig. 2 
Direction across dislocations Direction parallel to dislocations 
1:0 : 40 

$ 

0-75 SO e 
a 

¢ 5 
Ae) x 
9 w 
: 3 
fate” s 2 
® Oo 20 o 
# os e 
5 0 
i 3 
a v4 
o 

O25} 10 9 

: fe) 
OF FOROS) (e) 0-05 01 els) 


Distance in centimetres from point 


Signal amplitude and phase as a function of distance from detector 
contact on N-type germanium. 


Curve A. Phase at 4 ke/s. 
Curve B. Amplitude at 800 ¢/s. 
Curve C. Amplitude at 4 ke/s. 


Fig. 3 
Direction across dislocations Directions parallel to dislocations 
O4 ] 20 
15 


fe) 


Phase angle in radians 
loge (signal amplitude x distance) 


° 
a 


040 0-05 O 0-05 0-10 
Distance in centimetres from point 
Signal amplitude and phase as a function of distance from detector 
contact on P-type germanium, 


Curve A. Phase at 4 ke/s. 
Curve B. Amplitude at 800 ¢/s. 
Curve C. Amplitude at 4 ke/s. 


Minority Carriers in Dislocated Germanium 947 


4.2. High Field Drift Mobility : Results 

It was found that, in N-type germanium the drift mobility of holes 
parallel to the dislocations was the same as in undislocated material up 
to 3000voltem™. The drift mobility across the dislocations, on the 
other hand, was significantly less than the normal value for fields greater 
than about 100 voltcem~!. Some typical results are shown in fig. 4. The 
initial slope corresponds to a drift mobility of 1900 cm? volt sec, 
which is a reasonable value for high resistivity material (~30ohm cm) 
and in agreement with the results obtained by the conventional drift 
experiment (§ 2). 

Fig. 4 


20 T 


-5 


Hole drift velocity in cm sec x10 


O 200 400 600 800 1000 1200 1400 
Applied field in volts em 


Drift velocity of holes in dislocated N-type germanium as a function of 
applied electric field. 


Curve A. Parallel to the dislocation array. 
Curve B. Across the dislocation array. 


The mobility of the majority carriers can be deduced from current/ 
voltage characteristics of the filaments and no unusual behaviour in 
either N- or P-type samples was observed up to 4kvcem . 


§5 Discussion or RESULTS 


Shockley (1953) has suggested, on theoretical grounds, that edge 
dislocations act as a row of acceptor levels due to the dangling bonds. 
This model has been applied successfully to interpret the Hall effect in 
dislocated germanium (Pearson ef al. 1954) and the variation of carrier 
lifetime with temperature (Wertheim and Pearson 1957). If a significant 
fraction of the acceptor levels are occupied the potential energy of 


948 J. B. Arthur et al. on the 


electrons is raised locally and the dislocation may be considered as a 
thread of P-type material embedded in the crystal. The application of 
the model to the present results will be described quantitatively in a 
subsequent paper (Gibson and Paige 1958), but a qualitative interpretation 
of the data will now be given. 


5.1. N-type Germanium 

Threads of P-type material in an N-type bulk crystal will be surrounded 
by a space-charge region or P-N junction. The junction may~ be 
represented by a barrier capacitance and conductance in parallel. The 
dislocation is therefore analogous to a coaxial line formed by an inner 
P-type conductor and outer N-type conductor. As a P-type region 
represents a potential minimum for holes, injected carriers will be captured 
by the dislocations where they will become majority carriers. If the 
coaxial line was lossless a space-charge signal would propagate down the 
line with a velocity near that of light and a hole would be emitted anywhere 
along the line to compensate for the added charget. In practice the line 
is lossy and a hole will be emitted a finite time later at a finite average 
distance from the point of entry. A captured hole may move in either 
direction along the dislocation, so that, on the average, its mean position 
is unaffected by capture in a dislocation ‘thread’. It follows therefore 
that dislocations provide a mechanism for enhanced diffusion of holes 
along the dislocations without any corresponding increase in hole 
mobility, so that eqn. (1) is not applicable to this system. Clearly the 
dislocations cannot assist the diffusion of holes perpendicular to the array. 

As already stated, a hole captured in a dislocation thread will, in the 
absence of an electric field, leave on the average at the point of entry. 
If an electric field is applied along the dislocation the holes will drift at 
a velocity determined by the field and their mobility, as in any conductor. 
It is now convenient to change the frame of reference so that the holes 
are stationary and the dislocation is moving in the opposite direction. An 
injected hole, entering the dislocation thread, will now leave, on the 
average, a distance w.H#.t. further down the dislocation. Capture in a 
dislocation thread, therefore, will have no effect on the drift mobility in 
this direction. 

When the electric field is directed across the dislocations a very similar 
argument may be applied. In the absence of an external field a hole wil! 
enter and leave, on the average, at the same point in the cross direction. 
If the frame of reference is moving in the presence of a field, the hole will 
behave as if it were drifting at the normal rate. This argument will be 
valid provided that the distance moved by the dislocation in the time 
for which the hole is trapped is less than the radius of the thread. This 


t+ This result is essentially the same as that obtained by Moore and Webster 
(1955) for the propagation of a hole through a ‘ floating’ alloyed P region on 
N material. S 


Minority Carriers in Dislocated Germanium 949 


condition cannot be expected to apply at high electric fields and provides 
the basis for interpreting the data given in § 4. | 

Apart from the major features of dislocated N-type material discussed 
above it is possible to fit some of the relatively minor features into the 
model. For example, it may be expected that the density of holes at the 
dislocation threads in equilibrium will be increased by background 
illumination, with a resultant increase in conductivity of the inner P-type 
conductor and increased effective diffusion constant associated with the 
dislocation thread (table 2). It is also of interest to note that the high 
diffusion rate of holes along the dislocations provides an additional 
mechanism to that suggested by Bell and Hogarth (1957) for enhanced 
surface recombination in filaments cut across the dislocations. 


5.2. P-type Germanium 


In P-type material the dislocations are, of course, still P-type. However, 
the negative charge density along the dislocation may be significantly 
higher than the acceptor density in the high resistivity bulk material so 
there is still a space-charge region in which the electron potential energy 
is increased. The presence of this barrier will impede the capture of 
minority carriers (electrons) by the dislocations and, following the 
argument given by Bell and Hogarth, provides a mechanism by which 7 
may be anisotropic. The dislocations will clearly have no significant 
effect on the diffusion constant of the electrons, however, and no 
anisotropy of D can be expected. 


ACKNOWLEDGMENTS 


We are indebted to our colleagues for their advice and help and for the 
supply of suitable crystals. The paper is published by permission of the 
Controller, H.M. Stationery Office. 


REFERENCES 


Avery, D. G., and Gunn, J. B., 1955, Proc. phys. Soc. Lond. B, 68, 918. 

BELL, R. L., and Hocartn, C. A., 1957, J. Electron. and Control., 3, 455. 

Greson, A. F., and Granvitie, J. W., 1956, J. Hlectron., 2, 259. 

Gipson, A. F., and Parcs, E. G.8., 1958, Phil. Mag., 3, 950. 

LAWRANCE, R., and Grsson, A. F., 1952, Proc. phys. Soc. Lond. B, 65, 994. 

Moore, A. R., and Wesstsr, W. M., 1955, Proc. Inst. Radio Hngrs, N.Y., 48, 
427. 

Prarson, G. L., Ruap, W. T., and Morty, F. J., 1954, Phys. Rev., 93, 666. 

SHockLey, W., 1953, Phys. Rev., 91, 228. 

Wertuem, G. K., and Pearson, G. L., 1957, Phys. Rev., 107, 694. 


[ 950 | 


An Interpretation of certain Transport Properties in Germanium 
containing Parallel Arrays or Edge Dislocations} 


By A. F. Grason and EK. G. 8. Pack 


Royal Radar Establishment, Malvern 
[Received April 28, 1958} 


ABSTRACT 


An interpretation is given of the anisotropic effects observed by Arthur 
et al. (1958) in germanium containing parallel arrays of edge dislocations. 
The anisotropy of the diffusion constant and high field mobility in N-type 
crystals is considered quantitatively. The diameter of the space-charge 


cylinder surrounding the dislocations (1-6 x 10~*em) and the fraction of 
time an injected hole spends within the space-charge region (3) is deduced 


from the analysis. 


§ 1. INTRODUCTION 


It has been established that edge dislocations in germanium act as 
acceptor centres (Pearson et al. 1954) and that they are efficient sites for 
recombination (Wertheim and Pearson 1957). Shockley (1953) has 
accounted for the acceptor type behaviour in terms of the dangling bonds 
associated with an edge dislocation. Read (1954) has presented a model 
of an edge dislocation in N-type germanium. He calculates that up to 
about 10% of the available sites can be occupied by electrons, the electro- 
static charge on the dislocation being neutralized by a cylindrical 
space-charge region. The limitation on the occupancy is imposed by 
the coulomb interaction energy set up between adjacent electrons on the 
dislocation. 

Observations on the lifetime of minority carriers in dislocated material 
have not lead to a more detailed model of the dislocation and its effect 
on the surrounding medium (Wertheim and Pearson 1957). However, 
recently Bell and Hogarth (1957), using crystals containing parallel 
arrays of edge dislocations, have found that the diffusion length is 
anisotropic in both N- and P-type germanium. Further investigations 
of this effect together with observations on the mobility at high fields 
have been made by Arthur et al. (1958). A summary of the latters 
experimental observations will be given in §2. In the remainder of this 
paper a model will be presented which accounts for the anisotropic effects. 
In particular, a quantitative analysis is given of the anisotropy of 
diffusion constant found in dislocated N-type crystals. Certain para- 
meters of the model are calculated from the experimental data of Arthur 
et al. (1958). 


+ Communicated by the Authors, 


On Transport Properties in Dislocated Germanium 951 


§ 2. SUMMARY OF EXPERIMENTAL OBSERVATIONS 


The model, which we shall consider quantitatively for N-type 
germanium, has been constructed to explain the observations made by 
Arthur eé al. (1958). A summary of the effects observed in crystals 
containing parallel arrays of edge dislocations follows. 

For N-type germanium : 


(1) The diffusion constant of holes parallel to the dislocations is greater 
than that measured perpendicular. The latter has the normal value. 

(2) The lifetime of holes is not usually isotropic with direction of 
diffusion. 

(3) The apparent diffusion constant and lifetime of holes parallel to 
the dislocations increase and decrease respectively in the presence of 
background illumination. 


(4) At low fields the drift mobility of holes is isotropic and has the 
normal value. At fields in excess of 100 v cm7! the mobility perpendicular 
to the dislocations becomes significantly less than that parallel to the 
dislocations—the latter has the usual dependence on field. 


(5) The conductivity is isotropic in the same specimens at all fields. 


For P-type germanium the diffusion constant of electrons is isotropic 
but the lifetime is greater for diffusion parallel to the dislocations than 
across them. The conductivity is again isotropic at all fields. 

The magnitude of these effects varied from point to point in both N- and 
P-type crystals. 


§ 3. THE PRoposED MoDEL 


Two major difficulties in analysing the electrical effects of dislocations 
are that (a) the concentration and nature of impurities segregated at 
dislocations are at present unknown, and (b) the density of dislocations 
are not uniform and they may be polygonized into ‘ walls ’. The variation 
in magnitude of the anisotropic effects are probably related to the 
variations in dislocation density. Evidence that impurities segregated at 
dislocations are not playing a vital role in the anisotropic effects is that 
anisotropy was observed by Bell and Hogarth (1957) in both plastically 
deformed and grown dislocated crystals. It is most unlikely that 
dislocations formed under these two conditions would have the same 
impurity atmospheres. The model proposed, therefore, is essentially the 
same as that considered by Read (1954). However, it is found necessary 
to modify his model in certain respects and in the remainder of this 
section these modifications are presented. 

A qualitative discussion of the anisotropic effects using the. proposed 
model has been given by Arthur ef al. (1958). 


3.1. The Free Hole Density 


The electrical effects are considered to arise from the trapping of 
electrons at dangling bonds present at an edge dislocation ; these traps 


952 A. F. Gibson and EH. G. 8. Paige on 


are hereafter called dislocation sites. The trapped electrons raise the 
potential energy of electrons in the vicinity of the dislocation and increase 
the free hole density above that of the bulk. In high resistivity N-type 
germanium the material becomes P-type in a cylindrical volume around 
the dislocation, which we call the inversion cylinder. Even in P-type 
material sufficient electrons can be trapped at dislocation sites to 
significantly increase the local hole density, forming a P+ cylinder in 
the P-type crystal. 

Read (1954) considered the electrical effects of dislocations in low 
resistivity N-type germanium. He is able, therefore, to neglect the 
contribution of free holes to the space-charge surrounding the trapped 
electrons at sufficiently low temperatures. Since we are concerned with 
high resistivity (20 Q cm) material at room temperature we modify Read’s 
model to include a significant free hole contribution to the space-charge 
region. As will become apparent, the P-type inversion cylinder is vital 
to the analysis of the anisotropic effects. Direct experimental evidence for 
the existence of inversion cylinders in one of the crystals used by Arthur 
et al. (1958) has been presented by Hogarth and Baynham (1958). They 
have observed P-type rectification at ‘ walls ’ formed by the polygonization 
of dislocations in N-type germanium. They also found a significant 
lowering of the conductivity near dislocations in P-type material. Tweet 
(1955) has found relatively high P-type conductivity at the grain boundary 
in both P- and N-type gold doped bicrystals at low temperatures. The 
mobility of these holes was not so low as to indicate that at room 
temperature it would differ appreciably from the normal value. 


3.2. Energy Level of the Dislocation Site 


To provide a mechanism for the anisotropy of lifetime in P-type 
dislocated crystals Bell and Hogarth (1957) assumed there was a potential 
barrier to minority carriers surrounding the dislocation. In the proposed 
model this is formed by the cylindrical PP+ junction. The barrier height 
will depend on the ratio of the mean separation between ionized acceptors 
to the average distance between electrons trapped at dislocation sites. 
As a criterion for an effective barrier we take this ratio to be equal to or 
greater than unity. Since anisotropic effects have been observed in 
12 Qcem P-type germanium we can use this criterion to estimate an upper 
limit to e,, the energy of the dislocation site relative to the top of the 
valence band. Fermi statistics yield an upper limit of <, of 0-4ev. The 
use of Fermi statistics will tend to overestimate <, but because of the 
small fraction of dislocation sites occupied the approximation will be 
fairly good. This is significantly less than the value of 0-5ev used by 
Read (1954) in his calculations. The Hall constant and resistivity 
measurements on plastically deformed 15 Qcem N-type germanium made 
by Pearson et al. (1954) were consistent with «,=0-5ev. An interpretation 
of the variation of lifetime with temperature in 2Qcm P-type germanium 
by Wertheim and Pearson (1957) is consistent with the 0-4eyv or smaller 


Transport Properties in Dislocated Germanium 953 


value of «,. Unfortunately, because the dislocation density is about 
10°cm~, no information regarding ¢, would be gained from Hall constant 
measurements on the crystals used by Bell and Hogarth (1957) and 
Arthur et al. (1958). In the absence of more precise data we take <, to 
be about 0-4ev in our crystals, and, as previously stated, we do not 
believe this is associated with a particular impurity atmosphere. It is 
also significant that it will be found necessary to use this value of ¢, to 
account for the apparent anisotropy of diffusion constant in N-type 
germanium. 


§ 4. CHARACTERISTICS OF THE INVERSION CYLINDER 
IN N-TyPr GERMANIUM 


In discussing the anisotropy of the diffusion constant in N-type crystals 
Arthur e¢ al. (1958) suggested that the P-type cylinder in N-type material 
resembled a lossy transmission line. Space-charge signals due to 
fluctuations in the hole concentration in the inversion cylinder would be 
propagated along the cylinder with a change of amplitude and phase. 
In this section we determine these two quantities in terms of characteristics 
of the line, leaving their relationship to experimental observation till § 5. 

The coaxial transmission line will be formed from iterative elements of 
the form shown in fig. 1. In the figure, # is the resistance per unit length 


Fig. 1 


Element of transmission line. 


of the inner P-type conductor which has a radius rg. Between the radii rp 
and r, is the relatively carrier free region of the PN junction having a 
barrier capacitance and conductance per unit length of C and G 
respectively. Outside 7, the space-charge is zero and the material 
undisturbed N-type. For such a line the propagation equation of a 
signal S will be 


s=Syexp{-I., /3) = S, exp[ —Ua+iP)], Sa vem L) 


where Z=(G+iwC)— ; w is the frequency of the fluctuating signal. ‘The 


P.M, 37 


954 A. F. Gibson and E. G. 8. Paige on 


attenuation, «, and the phase shift, 8, per unit length, obtained from 


eqn. (1) are 


dn 8) _ (RGN 1 w2C2\ 1/2 ay ; 
eae) -(F anes! an pe ete ee 


dé RG\12 w2O2\ V2 1/2 
=o = (oe) hee) ee ae 


where 6=fl has been introduced for convenience. 

It would now appear that we have to calculate, C, Gand Rk. In practice 
it is only necessary to calculate R and G,, the hole conductance of the 
barrier. This is particularly fortunate since Rk and G, are determined by 
the properties of the inner conductor and the undisturbed N-type region 
respectively. Compared with C, R and G, are relatively insensitive to the 
shape and potential distribution in the junction. The barrier conductance 
per unit length for a small bias will be given by 


G=G,+G, 
=(1 +L ght. x 2) ene en 


Here J, and J,, are the saturation currents per unit length of the 
junction for holes and electrons respectively. It will become apparent in 
§5 that we are only interested in the hole current. The calculation of 
I, has been performed in the Appendix ; from the result we obtain a 
hole conductance per unit length of 


Gy = 279HyP, [In (2L/r,)— yy. . ene 
[4 is the normal hole mobility, p, the equilibrium density of holes in the 
N-type crystal, and L is the diffusion length of holes. 

The resistance per unit length of the inversion cylinder may be estimated 
as follows. If a fraction f of dislocation sites are occupied and the 
separation between sites is a, then a negative change of fq/a resides along 
unit length of the edge dislocation. For charge neutrality, 


4 + Pq +-ar(Np—N,q=0 en 


where P is the number of holes per unit length of the dislocation and 
(NV, —N 4) is the density of uncompensated donors. For a hole mobility 
of }4,q along the cylinder, the resistance per unit length becomes 


Ra{| f—a7:%p—N 4) | tral Sn Sa 


§ 5, THE RELATIONSHIP BETWEEN THE EXPERIMENTAL 
OBSERVATIONS AND THE PARAMETERS OF THE INVERSION CYLINDER 


The travelling light spot and drift mobility experiments of Arthur 
et al, (1958) will be compared with parameters of the inversion cylinder. 
Then, using the experimental data from the mobility experiments, a value 
of the apparent diffusion constant in the inversion cylinder together with 


Transport Properties in Dislocated Germanium 955 


the fraction of time spent by an injected hole in the cylinder will be found. 
These will then be compared with the observed diffusion constants. 


5.1. The Travelling Light Spot Experiment 


The technique used by Arthur et al. (1958) which enabled a simultaneous 
measurement of diffusion constant, D, and lifetime, 7, to be made has 
been described by Avery and Gunn (1955). The equations from which D 
and + can be deduced are 

d[{In (rS/rq) | 
el = (2Dr)~V9{(1 + wr?) "2 4 11172, Pe oe AY: 


and 

dé 

dr 
where w is the modulation frequency of the light spot. These two 
equations are identical in form to eqns. (2) and (3). The replacement of 
d/di(In S) by d/dr(InrS/rp) occurs because eqns. (2) and (3) are essentially 
one dimensional. In practice the variation of S with r is so rapid that 
d(In S)/dl and d/dr{In (Sr/ry)| are very nearly equal. We may therefore 
define 


= (2Dr)-N2f(1 + wr) N2— 1102, eect, eo) 


D,=(RC)— and tz=CG— 


as the effective diffusion constant and release time of a hole in an inversion 
cylinder. The quantity ¢, should not be confused with a carrier lifetime ; 
it is the average time a hole spends within a cylinder. The length /(D4t,) 
represents the mean range of a hole along the cylinder. 

Two processes limit t,, recombination at the dislocation sites and 
escape to the undisturbed N-type material. Thus the probability per 
unit time that an excess hole is removed from the inversion cylinder is 


ee Aes PN iC) 

Et alee ole 
where ¢, and ¢, are average times spent in the inversion cylinder before 
recombination and escape respectively. Suppose a group of holes have 
entered the inversion cylinder. Due to their presence the PN junction is 
forward biased and some holes will be ejected from and some electrons 
will flow into the P-region. The former process will be determined by the 
hole conductance of the barrier, the latter by the electron conductance. 
Accordingly we write 


t.=C/G, and t,=C/G,. og ee areas 

The fraction of time spent by a hole in an inversion cylinder, 7, is 

simply ¢,(t,+t,)1 where ¢,, the mean time between escape from and 
re-entering the cylinder, is given - 

eon lr. VE - Up) Pho etna ce (eb) 

Here p, is the density of tee es Vp is the thermal velocity of the 


hole, 
2 ee 


956 A. F. Gibson and E. G. 8. Paige on 


We may define a minority carrier lifetime in terms of t, by the equation 
T=, 7. et ee fee oe a EH) 


5.2. The Drift Mobility Expervment 


A characteristic time, ¢,, has been associated with the average interval 
between entering and leaving an inversion cylinder. We may therefore 
look upon the inversion cylinder as a trap of large physical dimensions. 
If the electric field is perpendicular to the dislocation array and the drift 
velocity, v, such that v < $(2r,/t,), then trapping effects will not be notice- 
able. However, for fields in excess of H,, given by 


B= 5. (72), ) egy eer 
2LLp the 


trapping will lower the apparent mobility. Under these conditions the 
apparent drift velocity v, will be 


0,= (aft y tel —n')) Sa ee 


Fig. 2 


10 x10” : * re r — 


oO) 


Vain cm sec™! 


2:5 5 75 10 125 15 x10° 
Vo in cm sec™! : 


Variation of the drift velocity perpendicular to the dislocation array (v,) with 
the normal drift velocity (vg) in the same electric field. — 


where v, is the normal drift velocity at the field considered. and 
n =t(t,+t.)'. At sufficiently high fields, such that Er kT | both 
r, and ¢_ will become field dependent in such a way as to ton to increase 
v, to the normal value. 

In fig. 2 v, has been plotted against v) and the three drift velocity 
regions can be seen. As predicted at the lowest fields, there is a linear 


; ett Hens : 
Transport Properties in Dislocated Germanium 957 


region of slope unity ; at higher fields the plot is linear and of slope 
(1—n’) ; at the highest fields, the departure from this slope is observed 
due to changes in 7, and ¢,. 

For an electric field parallel to the dislocation array, each dislocation 
considered to be of infinite length, the apparent mobility, j.,, is 


Ho=Upan tHp(l—q’). . . . oa. . (16) 


5.3. Parameters Derived from the Experimental Results 

By comparing fig. 2 with eqns. (14) and (15) we obtain 1—7’/=0-45, 
r,/t.=1-7 x 10°cmsec™ and (7,/t,)n’=0-9 x 10®>cmsec-!. The two values 
of 7’ are consistent and approximately 0-5 thus, from the definition of 
7, t, and t. must be about equal. From etch pit counts pz was found to be 
approximately 1x 10°cm~. Inserting this value in eqn. (12), ryt, (=r,t,) 
can be calculated. From r,t. and 7,t.-1 we obtain 7,=0-8 x 10-*em and 
f.=t,=5x1071’sec. The mobility measured parallel to the dislocations 
has the same value as in relatively dislocation free material at all fields. 
Because 7’ is a sufficiently large fraction this result enables us to deduce 
from eqn. (16) that 14 ta. 

A value of D, will be deduced now using the parameters that have 
been determined for the drift mobility experiments. The resistivity of 
the crystal used is about 20Qcm, hence (N,—WN,) is 1x 1044cem-%, 
p, is 3x 10! cm~3 and the position of the fermi level is 0-42 ev above the 
valence band. The hole conductance of the barrier is evaluated by 
substituting the typical value of L, 2x 10cm, in eqn. (5). G, is not 
sensitive to values of r, and LZ provided L>7,, as is the case. Using the 
value of «, suggested in §3, fis estimated to be close to the limiting value 
of 0-1. We can now calculate Rk from eqn. (7) remembering the separation 
between dislocation sites is 44 for an edge dislocation and that equating 
pq tO pg has been justified. We find R=1-2x10"%Qcm™. Hence a 
value of the effective diffusion constant in the inversion cylinders is 
obtained 


From eqn. (13), and the observed lifetime, it is readily shown that 
t.>t.. Also it has been found that injected holes spend about half their 
life in inversion cylinders. Without a rigorous calculation it is clear that 
the observed value of the diffusion constant parallel to the dislocations 
will be intermediate between D, and the normal value. Perpendicular 
to the dislocations the inversion cylinders will not affect the diffusion 
process and the normal value of D, 47 cm? sec“, is to be expected. Typical 
results presented by Arthur ef al. (1958) are : 


D parallel to dislocations ~ 100 cm? sec, 


D perpendicular to dislocations ~ 45 cm*sec. 


958 A. F. Gibson and E. G. 8. Paige on 


If a value of «, of 0-5ev is taken instead of 0-4ev, f falls appreciably 
and D,+0. It is important to notice that the observed value of D will 
be sensitive to the dislocation density since this determines 7’. This is 
the basis for the statement in §3, that the variation of the anisotropic 
effects from point to point in the crystal was due to pz. 


§ 6. CONCLUSION 


Arthur et al. (1958) have shown that an anisotropy of the diffusion 
length was due to an anisotropy in 7 in P-type crystals while an anisotropy 
in D was always found in N-type crystals. The model proposed in §3 
has been used to give a qualitative explanation of the former 
and a quantitative explanation of the latter. It does not provide a 
mechanism for an anisotropic D in P-type crystals, since the transmission 
line analogy is no longer valid, and indeed no anisotropy of D is 
observed. 

An explanation of the drift mobility data of holes in N-type specimens 
has led to a calculation of the diameter of the space-charge cylinder 
surrounding the dislocation. The value of 1-6 x 10-+cm for the diameter 
is not inconsistent with the observations of Hogarth and Baynham (1958) 
since they probed across dislocations which had been polygonized into a 
‘wall’. From R, 7, and p, the conduction of the inner P-type conductors 
parallel to the dislocations can be calculated. It is found that no 
significant enhancement of the conductivity parallel to the dislocation 
should be observed. This is consistent with the experimental measure- 
ments on the dependence of the conductivity on the direction of the 
electric field. 

The model for N-type germanium that has been presented does not 
lead to an anisotropy in + which sometimes accompanies the anisotropy 
in D. Since the measured lifetime is typically 10 usec, an injected hole 
passes in and out of an inversion cylinder 104 times before it reeombines 
with an electron. Clearly, then, the probability of recombination will be 
independent of the direction of diffusion relative to the dislocation array. 
The increased D measured parallel to the dislocation in the presence of a 
background light arises from a decrease in R due to the free carrier 
concentration in the inversion cylinder rising. The lifetime is limited by 
the rate at which electrons can surmount the potential barrier surrounding 
the dislocation sites. The photovoltage set up by the background light 
will assist the process and decrease the lifetime. 

So far the possibility of polygonization of the dislocations into 
dislocation * walls ’ has not been considered. Extension of the calculation 
of §4 and § 5 to a polygonized crystal has been performed and an enhanced 
value of D parallel to the walls can be obtained. However, the introduction 
of new variable parameters (e.g. packing of dislocations in the wall, 


dimensions of the wall) in this analysis makes it less stringent than that 
which we have given. 


Transport Properties in Dislocated Germanium 959 


APPENDIX 


We wish to calculate the hole conductance of a cylindrical PN junction 
If a small forward bias is applied to the junction, holes are injected into 
the N-type material and recombine at the randomly distributed recom- 
bination sites and at the dislocations parallel to the cylinder. Treating 
the problem as though the recombination sites associated with the 
dislocation were also randomly distributed we can write the continuity 
equation in cylindrical coordinates as 


d*p(r) | Ldp(r) _ p(r) 
Ee | Poe ee Pe ee fe A) 
Here L is the diffusion length of holes and p(r) the hole density in the 
N region. Substituting x=r/L and introducing v(x)=p(r), eqn. (A.1) 
becomes 

x? d?v(x) zi x dv(a) 

da? dx 

The solution of this equation (Jeffreys and Jeffreys 1956) is 


v(x) = aL (x) + BK (2), 
where Iy(a) = S(h2)?*. (1), 


k=0 


K (x) = — I(x) In (4x) + S(4x)2*(k !)2F (by) 
k—0 


—x?v(x) =0. eee eee (1 52) 


and F(k) is the digamma function. 
Using the boundary conditions that v=0 at v= © andv=v, atw=a,=7,/L 
and making approximations valid because 7,/L.<1, the constants « and B 
are found, 
a=0 and B=v,[In(2L/r,)-—y]", 
where y=0-58. 
Therefore, for values of 7 such that r/L.<1, 


20 at r 
=] —- — ee eee CAS 
pe)=—r{ n= -y | (In +7) (A.3) 
The diffusion current per unit length of the cylindrical junction is 
Os — i Qrr. Dy — 
=q. 27D, .p,[In (2L/r,)—y]" eels (Alt) 


where D,, is the diffusion constant of holes. Ifa forward bias of V volts is 
applied such that V <kT'/q, then following the usual PN junction theory, 
we can write the conductance per unit length due to holes as 


G, = 27g pp P,{ln (2L/r;) — y}™. <a ok, (ERS 
Here p, is the thermal equilibrium concentration of free holes in the 


N region and yu, the hole mobility. ; ' 

In this approach we have treated the recombination sites associated 
with all dislocations as if they were randomly distributed. Consider the 
parallel dislocaions to be arranged on a square network, then the eight 


960 On Transport Properties in Dislocated Germanium 


nearest neighbour parallel dislocations, at about a radius, k, of (2p4)-", 
may dictate the conductance by forming a ‘recombination surface ’ 

around the dislocation. Under these conditions eqn. (A.1) is modified 
by replacing L by L, (the diffusion constant in the absence of the parallel 
dislocation array) and by changing the boundary condition v=0 at 
x= co to 


su(“)=——. at iets 
Ly “dz Lc 
s is the recombination velocity of a surface containing eight parallel 
dislocations in 27R ; its value can be estimated from Okada’s (1955) data. 
The expression for the hole conductance per unit length is 


é Rs 
(Oe aa 2779 b-pPn Ty ° 


Since G,,’ is nearly two orders of magnitude less than G, calculated from 
eqn. (A.5) the neighbouring parallel dislocations do not have a controlling 
influence on the conductance, a result we expect since otherwise L—f. 
We are justified, therefore, in using (A.5) except that this is a d.c. 
conductivity, an a.c. conductivity is appropriate for our problem but for 
small bias they are approximately equal. 


ACKNOWLEDGMENTS 


We wish to thank our colleagues for helpful discussions, particularly 
Mr. A. C. Prior. The paper is published by permission of the Controller, 
H.M. Stationery Office. 


REFERENCES 


ArtTHuUR, J. B., Grpson, A. F., GRanvitz, J. W., and Paice, E. G.8., 1958; 
Phil. Mag. 8, 940. 

Avery, D. G., and Gunn, J. B., 1955, Proc. phys. Soc. Lond. B, 68, 918. 

Buti, R. L., and Hoeartn, C. A., 1957, J. Electron. and Control, 3, 455. 

Hoaartu, C. A., and Baynuam, A. C., 1958, Proc. phys. Soc. Lond., 71, 647. 

JEFFREYS, H., and JEFFREYS, B. , 1956, Method of Bf Alea Tne Physics, 3rd ed. 
(Cambridge: University Press). 

Oxapa, J., 1955, J. phys. Soc. Japan, 10, 1110. 

Pearson, G. L., Reap, W. T., and Morn, F. J., 1954, Phys. Rev., 93, 666. 

Reap, W. T., 1954, Phil. Mag., 45, 775. 

SHOCKLEY, W., 1953, Phys. Rev., 91, 228. 

Tweet, A. G., 1955, Phys. Rev., 99, 1182. 

Werrtsem, G. K., and Prarson, G. L., 1957, Phys. Rev., 107, 694. 


[ 961 | 


Cosmic Rays in the Earth’s Magnetic Field; 


By P. Rorawe1iu 
Physics Department, Imperial College, London 


[Received May 16, 1958] 


ABSTRACT 


It is shown that the values of cosmic ray cut-off momenta in the earth's 
magnetic field, observed at many different places, are generally close to the 
values calculated from Stérmer’s theory for the motion of charged particles in 
a dipole field, if the usual centre dipole of the earth is replaced in the Stormer 
equation by a dipole whose magnitude and direction are determined by the 
surface field at the place considered. An empirical expression for the actual 
cut-off momenta (in terms of the centre-dipole field, and the ‘ surface field ’ 
cut-off momenta) is deduced from the variation in sea-level nucleon intensity 
between London and Cape Town, and gives good agreement with experi- 
mental results over a wide range of latitidues and longitudes. It is concluded 
that discrepancies between centre dipole predictions and experimental 
observations of cosmic ray intensities and cut-off momenta are due to 
differences between the earth’s real field and the dipole approximation to it, 
rather than to distortion of the earth’s outer magnetic field by ionized inter- 
planetary matter. 


THE cut-off momenta and the intensities of cosmic rays measured at 
various latitudes and longitudes are often not in good agreement with the 
prediction of Stérmer theory for the motion of charged particles in the 
earth’s dipole field. These discrepancies have been ascribed to distortion 
of the earth’s outer magnetic field by highly ionized interplanetary matter 
(Simpson et al. 1956). On the other hand, recent measurements (Rothwell 
and Quenby 1957) have shown that cosmic ray intensity and surface 
magnetic field anomalies occur near the same places. This result implies 
that ‘anomalies’ in the earth’s real field modify the cut-off momenta of 
cosmic ray particles (of which the intensity is a function). A calculation 
of the effect on the Stormer cut-off momentum of a given regional magnetic 
anomaly, would be of great mathematical complexity, so we have used a 
phenomenological approach to the problem. 

Stérmer’s theory for the motion of charged particles in a dipole field 
gives the minimum momentum p,,, required for an incoming vertical 
particle of charge Ze, to reach the earth, as 


Ze M 
Poa a, pico A ME es 
where A is the geomagnetic latitude, M is the moment of the earth’s dipole, 


and R is the radius of the earth (6-4 x 10° cm). 


+ Communicated by the Author. 


962 P. Rothwell on the 


This may be conveniently expressed in terms of the horizontal field, 

H,,=MR-* cos d and the magnetic dip 6), given by tan 6, =2 tan d, as 
Pee ee ee pee oes ao 
4c (1+ } tan? 6))* 

We define py as the momentum cut-off corresponding to the surface 
field values of the horizontal component and dip, H, and 6, in the eqn. (2) 
that is 

p= ER (3) 
4c (1+4 tan? 5,)?? 

In this expression, the usual centred dipole, as determined from measure- 
ments of the earth’s field over the whole of the earth’s surface, has been 
replaced, effectively, by a dipole whose magnitude and direction are 
determined by the surface field at the place considered only. The path 
of a cosmic ray particle in earth’s field is, of course, determined not only 
by the field at the earth’s surface, but also by the field further out, which 
in general approaches the dipole field approximation in value with increasing 
distance from the earth. The actual vertical cut-off momentum p,; may 
therefore be expected to lie somewhere between p, and /,,. 

The cut-off momenta of «-particles have been measured directly at a 
number of places over North America, Europe and Australia. Table 1 
compares the observed values of cut-off rigidity p/Z with values computed 
(a) for the dipole field, p,,/Z, and (b) for the surface field, p./Z. It can be 
seen that while discrepancies between the observed and the centre dipole 
values are large in some places, remarkably good agreement is found 
nearly everywhere between the observed and the ‘surface field’ values of 
cut-off momentum. 

Direct measurements of cut-off momenta have not been made at latitudes 
lower than ~ 40°; in this region the experimental data obtained with a 
neutron intensity monitor on a sea voyage between London and Cape 
Town (Rothwell and Quenby 1957) has been used to find an empirical 
expression for the cut-off momentum p, in terms of ps and p,,: for similar 
intensities should be observed at places with the same cut-off momentum. 

Figure | shows the variation of cosmic ray intensity between London 
and Cape Town (a) with p, and (6) with p,. In (a) intensities in the 
southern hemisphere are higher than in the northern hemisphere, while 
in (6) they are lower. 

The condition that similar intensities should be observed at places 
with the same true cut-off is satisfied if we write 


Po=% Pgt (1—&)py (Ppl) a ) 
where is a factor which, if asswmed constant over the range of measure- 
ments, can be determined by equating values of p, at places with similar 
observed cosmic ray intensities. Table 2 gives values of Pp» Pe and 
Py/ Ps, for seven pairs of points at which similar intensities were recorded 
on the voyage of the * Roxburgh Castle’ between Cape Town and London, 
and the corresponding value of x calculated for each pair. It can be 


963 


Cosmic Rays in the Harth’s Magnetic Field 


60 Le | %os+ °Z-¢ Ort 1G S 0:LF (S261 7” ea 
GGT 7” 72a OOLBT 9 
Ure | oo |W |e | BE | 8 | coe | OM or meaty 
~9 — ep | %I1Z— °9-¢ Hg OF N 0-94 (9e6T eee 
pe — *pz | %ee- Ll HOLT °G.g N 9-49 (9961 eae 
Tele Oe ole "9-P %o9 + LY Nery oe Sy eeu ai 
O91 + “1. | feeor a ar Yoel + 9-1 Ree cela eae acute 
%0 etme hceset Wilebces |e %Itr |. 284 N 0-9 ME 
%e — “or | %LLt+ 96-1 “PL F Ll N o&-€8 aur at 
78 — IT | %8¢+ 61 PGT + a Neco poets eg, ae 
~az-> | %80 | %ost< | ot PIF Tt> | Nots¢ (L961 Leelee 
(q) (v) LOdL Yo peadosqg x jo 90R[q- 


SOIJIPISIY JO-INY) spyaqeg-% poyepnoyey) puw poAdesq¢ Jo uostareduoy 


st 19% 


Fig. 1 


——@—— N. hemisphere. 
~—-[-}- — —S. hemisphere. 


Fig. 1 (continued) 


° 2 4 é 3 to ta 1 lo 18 ao Bay 
3 


Fe 
Cosmic ray intensity versus cut-off rigidity on voyage of ‘ Roxburgh Castle ’. 


Table 2. Comparison of calculated cut-off momenta p,, p,, for cosmic ray 
particles, at places with similar observed intensities between Cape Town 
and London, and deduction of factor ‘x’ in the empirical expression for 
the actual cut-off momenta 


Po =Xps + (1—&) py (Pp/Ps) 


‘SolpISUSFUT UOTMOU AVI OTUTSOD popIOdet AVY YoryM sdrys fo soynoy 


‘Woduyayr----—----- ie 
© yuwyWle ecm cermeme es ‘ i 
‘HDUNGKOY--------- op | 
sre 
ool 
1 


oe ae 


— - 


On the Cosmic Rays in the Earth's Magnetic Field 967 


seen that when p, is expressed in the form given by eqn. (4), the value of 
# is, in fact, approximately constant for all latitudes given in the table. 
Taking #~0-9, eqn. (4) may be rewritten as 


Po=0-9p, + 0-1 (p,?/ps). ene eae ae (5) 

Figure 1 (c) shows the variation of cosmic ray intensities with pq; it 
can be seen that in this case the intensities measured in the northern and 
southern hemispheres lie on the same curve. 

The validity of eqn. (5) may be checked at many different latitudes and 
longitudes. 

(1) Neutron intensities have been recorded on voyages around the east 
and west coasts of America, and across the Pacific and Indian Oceans 
(Simpson ef al. 1956, Rothwell and Quenby 1957, Kodama and Miyazaki 
1957) (see fig. 2). Variation of intensity with cut-off momentum calculated 
(a) for the dipole field, (b) for the surface field and (c) from eqn. (5) are 
shown in fig. 3. It can be seen, while there are wide variations of intensity 
with centre dipole cut-off momentum p,, that similar intensities are 
observed at places with the same cut-off momentum pq, over a wide range 
of latitudes and longitudes, except perhaps in the region of anomalously 
high horizontal magnetic intensity, crossed by the Soya, south of Japan 
(Kodama and Miyazaki 1957). 


figs 
N08 cing p= 
“>, See ° 
MS rg ee misphere 
Bm Ne nemsy ‘Roxburgh ’. 
ON te eS Se S. hemisphere 
5 a wy oti . 
¥ VS x == = N hemisphere. Atkay 
Ae A \ -.——--—-—§, hemisphere (southbound). 
2 \% hr : 
S\ieee . : ¢ > 
qe Bae Ae —-—-—~-:-—-N. hemisphere Atka 
x aS * 5 
een a ‘ -.—---++—§. hemisphere (northbound). 
Bae NT \ 
Wk ae fe Be er re es rica ; 
8s eS nN KE. Ameri “Labrador. 
eheae | Wh. a —==--——= W. America 
we \ CANN \ s rs 
ANS NR Se pict oe -N. hemisphere . Snene 
At AY \ . . Oye . 
On Ree ae = Sahemisphere 


Fig. 3 (continued) 


(b) 


Cosmic ray intensity versus cut-off rigidity on voyages of ‘ Roxburgh Castle ’, 
‘ Atka’, “ Labrador’ and Soya’. 
Intensities on ‘Roxburgh’ and ‘Soya’ have been normalized to same 
value at Cape Town. Intensities on ‘ Atka’,‘ Labrador’ and ‘Soya’ 
{od | 5 


Ve ewe bic ee en ee re 


On the Cosmic Rays in the Earth's Magnetic Field 969 


(2) It has already been shown that the measured positions of minimum 
cosmic ray intensity lie, to a first approximation, on the dip equator 
(Rothwell and Quenby 1957). There are some discrepancies, because in 
the earth’s real field, unlike the dipole field, the positions of the maximum 
horizontal intensity and zero dip do not always coincide. The largest 
discrepancies between the measured position of minimum cosmic ray 
intensity and the dip equator occur in the equatorial region of South 
America, where the greatest distances between the positions of maximum 
H and zero dip occur. Figure 4 shows the experimentally determined 
position of minimum intensity reported by Simpson (1957) and others, 
together with (a) the geomagnetic equator; (b) the dip equator; and 
(c) the line of minimum cosmic ray intensity computed from eqn. (5) with 
x~0-9: the experimental points all lie very near (c). 


Fig. 4 
(a) —----—--—+ Geomagnetic equator. 
(}) ------ Dip equator. 
(c) —————— Cosmic ray equator. 


[-] Sea level neutron intensity data. 
© High altitude neutron intensity data. 


AS 


LATITUDE 


GEOGRAPHIC 


A . 60 
GEOGRAPHIC LONGITUDE 


Comparison of experimentally determined positions of minimum cosmic ray 
intensity with (a) geomagnetic equator, (6) dip equator, and (c) cosmic 
ray equator, determined from eqn. (5). 


(3) Table 1, column (c) gives the values of «-particle cut-off rigidities 
calculated from eqn. (5); there is satisfactory agreement with the observed 
values (although cut-off momenta calculated from the surface field only, 
Ps, give slightly better agreement with experimental results in several 
cases). 


P.M. 


970 On the Cosmic Rays in the Earth’s Magnetic Field 


The expression (5) for the cut-off momenta of cosmic ray particles in 
the earth’s field does, therefore, give satisfactory agreement with experi- 
mental results over a wide range of latitudes and longitudes. Hence we 
conclude that, except in those regions where the earth’s real field differs 
very much from the dipole field, the values of the actual cut-off momenta 
are close to those obtained by replacing the conventional centre dipole 
term in the Stérmer eqn. (1) by a dipole whose magnitude and direction 
are determined by the surface field components at the point considered. 

Now it is known that the main part of the deflection of a cosmic ray 
particle occur rather near the earth’s surface, at distances of the order of 
hundreds or at most a few thousand kilometres. Comparison of the dip 
and the horizontal component of the earth’s real field at various distances 
from the earth’s surface and at various latitudes and longitudes (using 
data compiled by Vestine et al. (1947) from the 1945 analysis of the earth’s 
field) with the values of dip and H calculated (a) for the dipoles deduced 
from the surface field components and (b) for the conventional centre 
dipole field, shows that the earth’s real field, up to ~ 1000 km above any 
particular point, is, in fact, better represented by a dipole whose magnitude 
and direction are deduced from the surface field at that point than by the 
centre dipole deduced from measurements of the earth’s field over the 
whole of the earth’s surface. 

It is concluded that discrepancies between simple dipole predictions 
and experimental observations of cosmic ray intensities and cut-off 
momenta are due to differences between the earth’s field and dipole 
approximation to it, rather than to distortion of the earth’s outer magnetic 
field by ionized interplanetary matter. 


ACKNOWLEDGMENTS 
The author would like to thank Professor P. M. 8. Blackett for helpful 


advice and criticism, and Dr. W. Webber for his information on measure- 
ments of «-particle cut-off momenta. 


REFERENCES 
Ay, H. H., and WappinerTon, C. J., 1957, Nuovo cim., 5, 1679. 
pE Marco, A., Mitong, A., and RermyHarz, M., 1956, Nuovo cum., 3, 1150. 
Fow.er, P. H., and WappinerTon, C. J., 1956, Phil. Mag., 1, 637. : 
Fow er, P. H., Wapprneton, C. J., Frerr, P. §., Navez, J., and Ney 
E. P., 1957, Phil. Mag., 2, 157. , 
Hopper, V. D., Lazy, J. E., and Lim, Y. K., 1958, Aust. J. Phys. (to be pub- 


lished). 
ie a and Miyazaxkt, Y., 1957, Report of Ionosphere Research in Japan, 


Pea F. B., 1957, Phys. Rev., 107, 1386. 
OTHWELL, P., and QuENnBy, J., 1957, Renort at Varen; ‘onfer 

(to be published in Nwovo cim.). Be aE OE eae 
Simpson, J. A., 1957, Report at Varenna Conference, June. 
Stmmpson, J. A., Fenton, K. B., and Ross, D. C., 1956, Phys. Rev., 102, 1648 
VEsTINE, E. H., Laporte, L., Lanes, I., Coopsr, C., and HEnprrix W C., 

1947, Carnegie Institute of Washington Publications 578, 580. 

Wappineron, C. J., 1956, Nuovo cim., 8, 930. Na 


[ 971 ] 


Deformation of Thin Films on Solid Substrates} 


By D. R. Brame and T. Evans 


Tube Investments Research Laboratories, Hinxton Hall, Cambridge 


[Received May 20, 1958] 


ABSTRACT 

Thin films of various face-centred cubic metals in the thickness range of 
300 to 700 A have been oriented on single crystals of silver and palladium. 
These specimens have then been deformed and the way in which the film 
accommodates the imposed strain has been determined by examination of 
the film in the electron microscope after stripping from the substrate. It is 
considered that the mode of deformation is determined by the ease with which 
dislocations can be injected into and through the film from the underlying 
substrate and factors which influence this transfer have been examined. 


§ 1. INTRODUCTION 

INTEREST was first aroused in the effects of thin films on the mechanical 
properties of single crystals by the discovery by Roscoe in 1934 that an 
oxide film, less than 20 atoms in thickness, increased the critical shear 
stress for slip in cadmium crystals by about 5%. Since that time many 
workers have confirmed the Roscoe effect with different surface films and 
substrates. Most of the work has been done with hexagonal close packed 
crystals such as zine and cadmium in which there is only one active slip 
plane (see for instance, Menter and Hall 1950). Andrade and Randall 
(1948, 1952) found that the presence of hydroxide films on cadmium single 
crystals completely stopped creep and that twice the stress was required 
to restart the creep. More recently, Lipsett and King (1957), sputtered 
thin gold films on cadmium single crystals and the stress at which plastic 
deformation started was increased by about 6gwt/mm?. The increase 
was found to be independent of film thickness for films in the thickness 
range of 1500-240004. The effect with zine single crystals has been 
demonstrated using oxide films (Harper and Cottrell 1950), electro- 
deposited copper (Pickus and Parker 1951, and Gilman 1951), 
electrodeposited nickel, gold, zinc and silver, and vapour deposited copper 
and silver (Gilman and Read 1952). With face-centred cubic crystals, 
very thin oxide films have been shown to increase the critical resolved 
shear stress of silver single crystals (Andrade and Henderson 1951). 

The increase in strength cannot be explained by adding the strength of 
the thin film to that of the underlying crystal as this would make the 
strength of the film impossibly large. This implies that the presence of 
the thin film is modifying the mechanical behaviour of the crystal and 
three interpretations have been given (Gilman 1955). (1) There is alloying 
at the interface which strengthens the crystal. (2) The presence of the 


+ Communicated by J. W. Menter. 
Be 


972 D. R. Brame and T. Evans on the 


surface film inhibits the operation of surface dislocation sources of the 
Frank—Read type. (3) The surface film prevents exit of dislocations from 
the surface of the crystal, i.e. slip is inhibited. 

Gilman (1955) has investigated the three possibilities and could not 
detect any significant alloying by spectrographic analyses with crystals 
which had been strengthened by plating a thin film of copper on their 
surfaces. He concludes that it is doubtful that alloying has an important 
effect on the strengthening of the crystals. He tried to differentiate between 
the second and third possibilities by a comparison of reflected x-rays from 
the surfaces of deformed plated zinc crystals and deformed clean crystals. 
He found that the presence of a thin copper film caused more distortion of 
the surface layers after deformation than for a clean crystal. This supports 
the hypothesis that the strengthening is due to the film inhibiting the 
passage of dislocations out of the surface of the crystal since if it inhibited 
the operation of surface dislocation sources, the distortion would be less 
for the plated crystal. This explanation is consistent with the work of 
Barrett (1953) who twisted polycrystalline zine and steel specimens which 
had oxide films on them. On removing the oxide layers, the wires twisted 
a small amount in the original twisting direction. He interpreted this 
increment in twist as due to the escape from the surface of the dislocations 
which had piled up under the oxide film as a result of the previous de- 
formation. Measurements of the strengthening of crystals by surface 
films have been almost exclusively confined to single crystals as the grain 
boundary strengthening effect in polycrystals is very large and almost 
completely swamps any strengthening due to surface films. 

As a general conclusion from this earlier work it may be stated that the 
strengthening effect of a thin surface film is associated with its behaviour 
as a barrier preventing the exit of dislocations from the substrate. 
Although not explicitly stated, it can be assumed from the conditions of 
preparation that the films used in previous experiments have been poly- 
crystalline. For the fundamental understanding of the inhibition mech- 
anism, such films are not particularly favourable since there is no control 
over grain size and relative orientation of the film and substrate. Both 
these factors are likely to affect the effectiveness of the film to act as a 
barrier to the exit of dislocations from the substrate. The new experi- 
ments described below have been carried out mainly on thin single crystal 
films prepared by vacuum evaporation on to single crystal substrates so 
that the film and substrate lattices are in parallel orientation. Under these 
circumstances the misfit at the boundary between the two lattices can be 
accommodated by a two dimensional network of dislocations with a 
spacing determined by the degree of misfit. To examine the parameters 
which determine the ease of propagation of dislocations through the 
boundary, a number of face-centred cubic films were oriented on silver 
single crystals, deformed in tension and the films examined by trans- 
mission in the electron microscope after stripping from the silver sub- 
strates. The transmission diffraction patterns show the structure of the 


Deformation of Thin Films on Solid Substrates 973 


films whilst such details as slip, cracking and dislocation distribution can 
be obtained by transmission microscopy giving an indication of the way 
in which the films have accommodated the applied strain when attached 
to the deformed substrates. Subsidiary experiments were also carried 
out with polycrystalline films on single crystal silver substrates, oriented 
films on polycrystalline palladium substrates, and also single crystal films 
attached to non-erystallographic substrates to determine the types of 
deformation which the films undergo under various conditions. 


§ 2. EXPERIMENTAL DeETatis 


Single crystals of silver were produced by a soft mould technique. 
Tensile specimens of silver with dimensions shown in fig. 1 were cut from 
a sheet 0-75 mm in thickness. A specimen was packed with alumina in a 
graphite mould which was passed through a furnace to produce the shaped 
single crystal. On removal from the mould, the specimen was lightly 
brushed with a soft brush to remove the alumina and then differentially 
etched for 30 seconds in the following solution to show the grain structure : 


1 pt. of 10 vol. H,O, to 2 pts. of 0-88 HN, OH. 


Fig. 1 


<—————— 35mm ——_> 
< 2 ee tote a 75mm a Sa ae ae 


A high yield of crystals was obtained which were single crystal along their 
gauge lengths and the orientations of these were determined from x-ray 
back reflection Laue photographs. ‘The specimen was chemically 
polished by dipping alternately in solutions A and B (Pinner 1953) and 
then thoroughly washed in distilled water. 


A 3 pts. of NaCN (37-5 g/litre) 
2 pts. of 10 vol H,O,; 

B NaN (37-5 g/litre) 
The crystal was placed on a small furnace in a vacuum evaporation plant 
and the residual surface contamination removed by ionic bombardment 
at 4kv with a current density of approximately 60 microamps/cm*. The 
method used for the deposition of oriented gold films was an extension of 
that developed by Pashley (1958) for the production of thin gold films on 
silver-mica substrates. The silver specimen was maintained at a tem- 
perature of 270°c and the gold evaporated from a tungsten filament. The 
amount of gold evaporated was calculated to correspond to a film thick- 
ness of 500 4, and it was found in practice that the thickness varied in the 
range of 300-7004. This method can be used for the production of thin 
single crystal films of arbitrary orientations since epitaxial growth occurs 


974 D. R. Brame and T. Evans on the 


for all surface orientations of the silver substrate. This is useful in such 
an investigation as silver substrates can be chosen of such orientations that 
the {111} slip planes in the oriented gold films have large projected areas 
when viewed by transmission in the electron microscope. The deposition 
of oriented gold—palladium alloy, platinum and rhodium films of the same 
nominal thickness as the gold films was done by evaporating from a coiled - 
coil of tungsten with silver substrate temperature of 250, 350 and 400°C 
respectively. © Deposition was also carried out with the silver substrates 
at room temperature and oriented single crystals of gold and polycrystalline 
films of gold—palladium alloy, platinum and rhodium were produced by 
this method. All specimens were deformed by about 15% elongation and 
the surface film removed by dissolving the substrate in 35° nitric acid. 
After washing, the film was mounted on a copper grid, dried, and examined 
by transmission in thé Siemens electron microscope (Klmiskop I operating 
at 80 kv). 


§ 3. RESULTS 
3.1. Single Crystal Films 


3.1.1. Crystallographic substrates 
Gold films on silver (lattice misfit with silver <0-2°%). Examination of 
an oriented single crystal film after stripping from the silver substrate 


Au film 


Thickness variation 


shows that the applied strain is accommodated by a thinning of the film 
due to slipping on the active (111) planes. The thinning which occurs in a 
gold film due to slipping is shown diagrammatically in fig. 2. The slipped 
regions in the electron micrograph are shown as’ light * Baeg indicating a 
higher transmission than regions on either side. The gold films remain 


Deformation of Thin Films on Solid Substrates 975 


coherent and no cracking is observed even after considerable straining 
(approximately 30% elongation). Figure 3, Pl. 60 shows a gold film 
which exhibits slipping in two directions due to the substrate orientation 
being favourably positioned for duplex slip. The contour type of contrast 
is associated with the buckling of the film and is due to the high intensity 
of the diffraction when the planes of the film are oriented near the Bragg 
angle for the incident electrons. When the specimens are deformed in 
compression instead of tension the slip lines have a reversed contrast 
showing that the slip is in the opposite direction producing thickening at 
the slipped regions. Dislocations can be detected in the film by the 
contrast associated with the stacking fault between the separated partial 
dislocations (Whelan and Hirsch 1957) which occur in face-centred cubic 
metals. Figure 4, Pl. 60 shows such a distribution of dislocations and these 
are considered as dislocations which have been injected into the film from 
the substrate during deformation and have been retained in the film after 
stripping. 

The behaviour of the oriented gold film when strained on a silver sub- 
strate implies that enough dislocations are injected into the film for the 
applied strain to be accommodated completely in a ductile manner by the 
formation of slipped regions. If this were not so, the film would crack 
and this has not been found to occur with gold films. Examination of gold 
films which have been stripped from the silver substrate without previous 
deformation shows a complete absence of slipped regions but with a 
random distribution of dislocations which have been grown into the film. 
It is not considered that these dislocations are responsible for the slipped 
regions observed in deformed films since the slip occurs on discrete planes 
and a general slipping would be expected if in-grown dislocations were 
responsible. Optical observation of the deformed silver specimen shows 
no obvious difference between the slip lines on the coated surface and the 
opposite uncoated surface. This favours the hypothesis that the dis- 
locations responsible for the slipping in the film have been provided by the 
underlying silver. 

In the experiments described here, the substrate temperature during 
the evaporation of the gold was 270°c but oriented films deposited at room 
temperature show identical effects after deformation on a silver sub- 
strate. 

Gold—palladiwm alloy films with a nominal composition of 40% Au—60% 
Pd on silver (lattice misfit with silver=1-9%). Examination of the 
Au/Pd film after deformation and stripping shows as before that the strain 
is accommodated in the film by slipping on {111} planes with no evidence 
for cracking. Figure 5, Pl. 60 shows a Au/Pd film with thinning due to 
slipping on one set of {111} planes and it is again considered that enough 
dislocations can be injected into the film to enable it to deform in a ductile 
manner. 

Platinum films on silver (lattice misfit with silver=4:1%). With 
oriented films of platinum on substrates of silver slip is again observed 


976 D. R. Brame and T. Evans on the 


but, in addition, cracks sometimes appear in the general direction of the slip 
lines. Figure 6, Pl. 61 shows such a crack with the slip lines on either side 
of it. In this case it appears that the strain is only partly accommodated 
by slip due to the injection of dislocations and failure presumably occurs 
because the number injected is too small tor the film to deform entirely 
by slipping. The cracking is envisaged as a brittle failure along a thinned 
region where the stress is high due to the distortion produced by the piling 
up of dislocations in the active slip planes of the underlying silver. Figure 7, 
Pl. 61 shows a platinum film which accommodated the imposed strain 
entirely by slipping on two slip systems. It is possible that when duplex 
slip sets in at an early stage in deformation, the sharing of slip on more 
than one set of slip planes makes the passage of dislocations from the sub- 
strate into the film more favourable than when slip on a single set of 
planes occurs extensively. This would then make the passage of dis- 
locations dependent upon the orientation of the system. 

Rhodium films on silver (lattice misfit with silver=7-37%). With 
rhodium films on silver substrates, extensive cracking occurs with a 
slight amount of slipping. Figure 8, Pl. 61 shows such an area with a 
slight amount of slipping and fig. 9, Pl. 61 illustrates cracking without the 
presence of slip. As with platinum, it is considered that not enough dis- 
locations can be injected into the rhodium film to accommodate the applied 
strain and cracking occurs along the highly stressed regions above the 
active slip planes in the silver substrate. 

Barrier effect of rhodium films. The experiment described above suggests 
that it is difficult for dislocations to pass from a silver substrate into and 
through a rhodium film. In this sense, the interface may be considered 


Fig. 10 


Au 
Ag 
Rh 


Ag 


as a barrier to the passage of dislocations and a multilayer of thin films has 
been grown on a bulk silver substrate to test this. Thin oriented layers 
of rhodium, silver and gold were successively evaporated on to a bulk 
silver single crystal substrate producing a specimen as shown in fig. 10. 
This was then deformed in tension and the gold film detached by dissolving 
away the silver. Examination of the gold film showed that it had cracked 
in an apparently brittle fashion, as shown in figure 11, Pl. 62. It seems 
probable that the rhodium film is acting as a barrier to the passage of dis- 
locations from the silver substrate into the multilayer, and the gold, 


Deformation of Thin Films on Solid Substrates 977 


being on the outside, accommodates the imposed strain by cracking because 
not enough dislocations can be injected into it to enable slip to take place. 

The barrier effect of the rhodium film-silver substrate interface was 
demonstrated in another way. An oriented single crystal film of rhodium 
was evaporated on one side of a single crystal tensile specimen of silver 
and an oriented film of gold evaporated on the other side, as shown in 
fig. 12. This specimen was deformed in tension and the gold and rhodium 


Fig. 12 


films examined subsequently. The rhodium film had cracked as before 
and the gold film showed a far higher density of injected dislocation 
groups than was noted when the rhodium film was absent, as shown in 


Fig. 15 


Source ee 


Rh 


Slip plane 


978 D. R. Brame and T. Evans on the 


figs. 18 and 14, Pl. 62. This is attributed to the fact that the rhodium 
film, in places where it is not cracked, acts as a strong barrier to the 
passage of dislocations which pile up against the rhodium film boundary. 
This produces a strong back stress which stops active sources from 
operating. The gold-silver interface allows the passage of dislocations 
through it and thus a distorted configuration of dislocation loops is pro- 
duced on the active slip planes as shown schematically in fig. 15. The 
rhodium effectively pins the loops in the specimen and the gold film can 
trap segments of the loops which have passed through the gold film but 
not transferred through the rhodium film. In addition, the hardening 
due to the presence of the rhodium film causes more dislocation sources 
to become operative in the silver substrate. This will occur in the sub- 
strate slip planes over which the rhodium film has not cracked as cracking 
would enable the dislocations to escape easily from the silver substrate on 
the rhodium side. 

Gold films on palladium (lattice misfit=5%). Large grained poly- 
crystal specimens were used as substrates on account of the difficulties 
of making single crystals with the material available. Comparison with 
experiments using similar silver substrates indicates that provided grain 
boundary regions are ignored the behaviour of the film on any particular 
grain may be taken as typical of a single crystal in the corresponding : 
orientation. 

It was found that the strain in an oriented gold film on a palladium sub- 
strate is accommodated entirely by slipping. 

Rhodium jilms on palladium (lattice misfit =2-26°,). Again the 
strain is accommodated almost entirely by slipping although occasional 
cracking does sometimes occur. The significance of these experiments on 
palladium substrates in relation to those on silver is discussed below. 


3.1.2. Non-crystallographic substrates 

The thinning of the gold films on silver substrates by slip during 
deformation suggests that the dislocations required for this mode of 
deformation arise from the active slip planes in the crystallographic sub- 
strate. To test this, a single crystal gold film was attached to a rubber 
balloon which served as a non-crystallographic substrate. Optical obser- 
vation of the gold film when the balloon was inflated showed that the 
film progressively cracked into smaller pieces. This brittle cracking is 
attributed to the absence of injected dislocations from the substrate to 
accommodate the imposed strain. Owing to the complex stressing con- 
ditions with balloons similar experiments were carried out on a film 
attached to a tensile specimen of a polymer when deformation again 
produced a brittle type of cracking in the film with the crack direction 
normal to the applied stress. 


3.2. Polycrystalline Films 
By deposition with the substrate at room temperature, unoriented 
polycrystalline films of gold—palladium alloy, platinum and rhodium with 


Deformation of Thin Films on Solid Substrates 979 


a grain size of less than 100 A have been formed on single crystal substrates 
of silver. The film and substrate were again deformed in tension and 
the film examined in the electron microscope after stripping from the 
silver substrate. In these cases, the strain was accommodated in the film 
by cracking along the directions of the slip lines on the surface of the under- 
lying silver substrate. Figure 16, Pl. 62 shows a typical region of cracking 
in one direction. These films are considered to be the type which have 
_ been used in the published work on the strengthening effect of films on 
single crystals. 


§ 4. Discusston 


It is first necessary to consider the origin of the dislocations which permit 
a surface film to deform by slip when it is oriented on a substrate which is 
deformed plastically. A number of alternatives are possible. 

(1) Deformation by virtue of the movement of the dislocations grown 
into the film. 

(2) Activation of dislocation sources in the film. 

(3) Deformation by virtue of dislocation sources operating in the bulk. 
substrate which emit dislocations able to pass into and through the film. 

(1) The experiment with a Au/Ag/Rh multilayer on silver indicates that 
extensive plastic deformation of a gold film by the movement of dis- 
locations is not likely. This is consistent with observations by Pashley 
(1958) that unattached thin single crystal metal films of this thickness 
(~ 500A) are inherently brittle. Furthermore the visual evidence from 
experiments where the film is ductile (e.g. Au on Ag) shows that the de- 
formation in the film occurs by extensive glide on single or closely spaced 
slip planes. The grown in dislocations distributed at random throughout 
the film would not produce plastic deformation of this type even if they 
were able to move. 

(2) Such regions of extensive glide could be produced by the activation 
of sources within the film. Again the experiment with the Au/Ag/Rh 
multilayer on silver shows that this is extremely unlikely. There is the 
additional possibility that dislocation sources of the type observed by 
Whelan, Hirsch, Horne and Bollmann (1957) in stainless steel could 
operate. They found that dislocations can be nucleated at a comparatively 
low stress in a wedge shaped film such as might occur near a hole or an 
edge. Although there are sometimes holes in the films they are much too 
infrequent to account for the deformation, and slipped regions in the film 
are never observed to be specially associated with such holes as are present. 
A wedge shaped film may be formed at the edges of a specimen as shown 
at A and B in fig. 17. In order to eliminate this as a possible source of 
dislocations a silver specimen was coated all round with a gold film of uni- 
form thickness as shown in fig. 18. No difference was observed in the 
characteristics of the deformation of the film and it is concluded that 
possible wedges at the edges are not responsible for nucleating the 
dislocations which result in the ductile behaviour of the gold film. 


980 D. R. Brame and T. Evans on the 


(3) All the experimental evidence is consistent with the assumption 
that the behaviour of the film is determined by its reaction to dislocations 
arising from sources activated in the interior of the substrate. These 
produce dislocation loops on the active slip planes and the mechanical 
response of the film depends upon the ability of these dislocations to pass 
into and through the film. Factors which affect the ease of passage of 
dislocations into the film include : 

(a) alloying between the substrate and the film, 

(b) the structure of the film itself, 

(c) the difference in elastic moduli of the substrate and the thin film, 
and 

(d) the degree of misfit between the substrate and the film. 


Fig. 17 
Au film 
A B 
Ag substrate 
Fig. 18 
Au film 


Ag substrate 


(a) The extent to which alloying occurs during the growth of these films 
Is at present uncertain and the problem is being investigated in the 
laboratory. However, it has been found that oriented gold films on silver 
substrates show the same type of deformation whether they have been 
grown with the substrate at room temperature or at 270°c. This suggests 
that in this system at least, the effects of interfacial alloying are small as 


Deformation of Thin Films on Solid Substrates 981 


presumably less alloying would occur during deposition with the substrate 
at room temperature. 

(0) A perfect oriented single crystal film has its {111} slip planes parallel 
to those in the substrate and a dislocation can pass from the substrate slip 
plane into the film. Thus no misfit is left behind at the boundary during 
the transfer due to the slip plane in the film being misoriented with respect 
to the substrate slip plane. On the other hand with a polycrystalline film 
on a single crystal substrate it is very difficult for a dislocation to be trans- 
ferred because the slip planes in the grains of the film are randomly 
oriented with respect to one another and also to the active slip planes in 
the substrate. Thus a polycrystalline thin film on a single crystal sub- 
strate accommodates the imposed strain by cracking in the directions of 
the slip traces in the underlying silver. These are presumably where 
there are stress concentrations due to the distortion produced in the 
substrate by piled up dislocations below the film-substrate boundary. 


(c) Head (1953) has analysed the image forces arising when a screw 
dislocation in a substrate approaches a surface to which is attached a thin 
film of different elastic modulus. If the elastic modulus of the film is 
greater than that of the substrate, a dislocation approaching the surface 
experiences first an attraction up to a critical distance and thereafter a 
repulsion from the surface. The equilibrium distance of the dislocation 
from the boundary depends upon the relative elastic moduli and the 
film thickness. The critical distance varies from less than 50A in the 
case of a 400A gold film on silver, half the film thickness for platinum on 
silver and greater than the film thickness in the case of rhodium film on 
silver. Head’s analysis is concerned with a screw dislocation but he 
considers that the same conditions hold for an edge. The shear moduli of 
the metals used in these experiments are given in the table. 


Degree of misfit Shear modulus 
MRS agate with substrate (Million p.s.i.) 
Silver Substrate — 3°9 
Gold = 0:29, 4-1-5-6 
Gold/ Palladium Loy, — 
Palladium 5:0% 6:7 
Platinum 4-1% 8-5-10 
Rhodium PROWESS 16-20 
Palladium Substrate — 6:7 
Gold 5%, 4-1-5-6 
Rhodium 2:26% 16-20 


The modulus of the film is always greater than that of a silver substrate. 
We may therefore expect that in all these cases the Head effect will tend 
to inhibit the exit of dislocations from the substrate with a repulsion 
which increases as one progresses from gold to rhodium films. Thus, 


982 D. R. Brame and T. Evans on the 


with silver substrates and the films which have been used, as the degree 
of misfit between the two lattices increases so does the repulsion due to 
the relative elastic moduli. To try to differentiate between the two 
effects, gold and rhodium films in turn were deposited on palladium sub- 
strates. In the gold case, the modulus of the film is less than that of the 
substrate which, according to Head’s analysis, will result in there being 
an attractive force on a dislocation approaching the surface. As the 
strain in the film is entirely accommodated by slip, although the lattice 
misfit is 5°, it is concluded that the ratio of the shear moduli has some 
effect on the passage of dislocations from the substrate into the film. In 
the case of rhodium on palladium there is a large difference in the moduli 
with the film having the higher modulus. This time, slip again occurs in 
the film with very occasional cracking with a lattice misfit of 2-26% which 
again indicates that the ratio of the shear moduli is having some effect. 
(d) The main influence in determining the transfer of dislocations from 
the substrate to an oriented film is thought to be the difference in lattice 
parameter between them. Firstly, the barrier effect of the accommodating 
network of dislocations between them (van der Merwe 1950) must be 
considered. The dislocations of this network are mobile only in the 
boundary and their spacing depends upon the misfit between the two 
lattices and the orientation of the interface. On a (111) interface the 
network has trigonal symmetry, whereas on (110) interfaces it is rectan- 
gular and square on (100) interfaces. Gold, gold—palladium alloy, 
platinum and rhodium have a lattice misfit with respect to silver of 
<0-2%, 1:9%, 4:1% and 7-37% respectively, and the spacing of the 
accommodating network on a (111) interface would vary from the order of 
2000 4 in the case of gold to 324 in the case of rhodium. With a gold 
film on a silver substrate, the hindrance to the passage of dislocations 
through a network of 2000 A spacing is expected to be small. For instance, 
in the particularly simple case where the accommodating dislocations 
being cut have a Burgers vector parallel to the mobile dislocation as it 
reaches the boundary, transference is possible by the following mechanism. 
In fig. 19, (a2) shows the mobile dislocation line in the silver approaching 
the silver—gold boundary. At (>) the mobile dislocation is pinned at the 
accommodating dislocations and the segments bow out between the pinning 
positions. At (c) the segments have cut the free surface and move to- 
wards one another and A combines with B, C with D, and E with F. This 
results in a slipped region in the gold film itself by the effective transference 
of the mobile dislocation from the silver substrate through the gold film. 
The stress required by such a cutting mechanism is extremely small and 
is 44yb/2500 where 1, = shear modulus of gold and b = Burgers vector of 
- the dislocation in the gold. This assumes a film thickness of 5004 and 
a network spacing of 2000 AS This is a particularly simple case and does 
nee involve jog formation but it does serve to illustrate the weakness of 
she accommodating network in the case of a gold film on a silver substrate. 
In the general case, the Burgers vector of the dislocations being cut will 


Deformation of Thin Films on Solid Substrates 983 


not be parallel to the cutting dislocation and jog formation will occur. As 
the misfit increases and the network spacing consequently decreases, the 
stress required for a dislocation to cut through willincrease. Cottrell (1953) 
has pointed out that the stress required for a dislocation to cut through a 
row of static dislocations is inversely proportional to the spaciug between 
the static dislocations and it again appears that as films of gold, gold— 
palladium, platinum and rhodium are used in turn on silver substrates it 
will be progressively more difficult for the mobile dislocation to cut through 
the interfacial network owing to the decreasing network spacing. 


7 Accommodating dislocations 
u 


RSABWQVQV, 


Dislocation line 
approaching boundary 


(a) 


(<) 


The actual transfer of a dislocation from a single crystal silver sub- 
strate to an oriented film of a different material will now be considered as 
‘this may contribute a strong barrier to the passage of dislocations. When 
a dislocation passes from a substrate with a particular lattice parameter 
into an oriented film with a different lattice parameter, the Burgers vector 


984 D. RB. Brame and T. Evans on the 


changes in crossing the boundary and a portion of the Burgers vector is 
left behind at the interface. Consider a dislocation of Burgers vector b, 
in the substrate. On passing into the film the Burgers vector changes to 
b, and when a silver substrate is used |b,|<|b,|. After dislocations 
have passed the accumulated Burgers vector left behind at the boundary is 
n(b,—b,). This is illustrated in fig. 20. The energy at region CD can in- 
crease in the slip plane until n(b,;—b,)=b, (or more probably > 3b.) 
which would create a dislocation in the slip plane at CD. This dislocation 
can then pass into and through the film and the situation shown in fig. 21 


Fig. 20 
A We nb2 
Film B 
Sh yee nb; —n(bi—b2) 

iy D ya 

Fig. 21 

A pes (n+1)b2 
B : 


Film 


(e = 
nb; 
Vig Gn 


would be realized. Then the process can be repeated by a further accu- 
mulation of Burgers vector at CD with a further creation of dislocations 
which can move through the film. The slip plane at CD is a region of Stee 
fit between the substrate and film lattices which can be accommodated b 

the formation of an accommodating dislocation network at the OD vee 
face. It is not possible for the Burgers vector residues left behind at CD 


Deformation of Thin Films on Solid Substrates 985 


by the passage of dislocations from the substrate to the film to build up 
the accommodating network at CD and it seems essential for two types of 
dislocation to be formed at the interface at CD; one to relieve the 
increasing strain due to the Burgers vector residues left behind at the inter- 
face and the other to relieve the localized strains due to the mismatch 
between the two lattices on either side of the CD portion of the slip plane. 
When |b,| > |b,| asin the case of an oriented gold film on a palladium sub- 
strate, the same type of mechanism occurs at CD except that the extra 
dislocation created by an accumulation of Burgers vector residues would 
have the opposite sign to those moving on the active slip plane of the 
substrate. This dislocation then combines with one of the mobile dis- 
locations and they annihilate so that (n+ 1) dislocations move up the sub- 
strate slip plane to CD and n dislocations emerge at the free surface AB. 
An accommodating network of dislocations would again be necessary in 
the portion of the slip plane at CD to relieve long range stresses due to the 
lattice mismatch on either side. The accumulation of the Burgers vector 
residues occurs in the active slip plane as does the accommodating network 
at the region CD. These dislocations can act as a strong barrier te the 
passage of dislocations on an active slip plane as they lie in the active slip 
plane and the effect increases as the number of dislocations passes from the 
substrate into the film. Again, the rate of increase of such a hardening 
process is directly related to the degree of misfit of the two lattices since 
both the number of dislocations n required before a dislocation can be 
created at CD and the spacing of the accommodating network decreases 
as the mismatch increases. The portion of slip plane CD will be so small 
that the normal network of accommodating dislocations for a (111) inter- 
face with trigonal symmetry will probably not be reached but some system 
seems to be necessary. 

Experimentally, it seems that, in the case of an oriented gold film on a 
silver substrate at least, the active slip planes become inoperative before 
the stage is reached where a dislocation is created at the interface due to 
the accumulation of Burgers vector residues. This is because it is observed 
that the gold films even as thin as 100 4 do not break due to sliding off on 
the slip planes. This means either that the normal slip distance of silver 
crystals is less than the order of 100 4 anyway or the barrier at the interface 
due to the presence of an oriented gold film makes the active dislocation 
sources inoperative at an earlier stage and the slip steps are smaller. 

In the case of oriented thin films of the other materials used, it is possible 
for the stage to be reached where the accommodating network in the active 
slip plane is formed at the interface and where the residue of Burgers 
vectors creates an extra dislocation. 

As a general conclusion to this work it is considered that the mode of 
deformation of an oriented film on a substrate depends upon the ability of 
dislocations to be injected into the film from the underlying substrate. If 
enough dislocations can be injected, the film will deform in a ductile manner, 
if not, the film will crack. The experimental evidence indicates that the 


P.M, 3% 


986 On the Deformation of Thin Films on Solid Substrates 


controlling factors are the degree of misfit and the relative elastic moduli 
of the film and substrate. Although the former is predominant no quanti- 
tative criterion for the maximum permissible misfit to obtain ductility 
can be given since the relative moduli of film and substrate undoubtedly 
influence the behaviour of the film. 


ACKNOWLEDGMENTS 


The authors wish to thank Dr. J. W. Menter for his guidance throughout 
this work and Drs. B. A. Bilby, A. J. Forty and D. W. Pashley for useful 
comments. This paper is published by permission of the Chairman of 
Tube Investments Limited. 


REFERENCES 


ANDRADE, E. N. pa C., and Henpsrson, C., 1951, Phil. Trans., 244, 177. 

ANDRADE, E. N. pa C., and Ranpatt, R. F. V., 1948, Nature, Lond., 162, 
890 ; 1952, Proc. phys. Soc. Lond. B, 65, 445. 

BARRETT, C. 8., 1953, Acta Met., 1, 2. 

CoTTrRELL, A. H., 1953, Dislocations and plastic flow in crystals (Oxford : 
Clarendon Press). 

GitmaNn, J. J., 1951, Trans. Amer. Inst. min. (metall.) Engrs, 191, 1148 (J. 
Metals, N.Y.,3) ; 1955, AS.T.M. Special Technical Publication No. 171, 


Bi 

GitmaNn, J. J., and Reap, T. A., 1952, Trans. Amer. Inst. min. (metall.) Engrs, 
194 (J. Metals, N.Y., 4). 

Harper, S., and Cotrreny, A. H., 1950, Proc. phys. Soc. Lond. B, 68, 331. 

Heap, A. K., 1953, Phil. Mag., 44, 92. 

Liesett, F. R., and Kina, R., 1957, Proc. phys. Soc. Lond. B, 70, 608. 

Menter, J. W., and Hatt, E. O., 1950, Nature, Lond., 165, 611. 

Pasuuey, D. W., 1958 (to be published). 

Pickus, M. R., and Parker, E. R., 1951, Trans. Amer. Inst. min. (metall.) 
Engrs, 191, 792 (J. Metals, N.Y., 3). 

PINNER, R., 1953, Electroplating, 6, 401. 

Roscoe, R., 1934, Nature, Lond., 183, 912. 

VAN DER Merwe, J. H., 1950, Proc. phys. Soc. Lond. A, 68, 616. 

WHELAN, M. J., and Hirscn, P. B., 1957, Phil. Mag., 2, 1303. 

WuHeE.an, M. J., Hirscu, P. B., Horne, R. W., and Botitmann, W., 1957 
Proc. roy. Soc. A, 240, 524, , 


[ 987 ] 


Anharmonic Effects in the Theory of Solid Argon} 


By I. J. ZuckrEr 


Wheatstone Physics Laboratory, King’s College, Strand, W.C.2t 
| Received April 19, 1958} 


ABSTRACT 


The thermodynamic properties of solid argon are evaluated theoretically 
using the Debye quasi-harmonie theory and an Einstein theory modified to 
include anharmonic effects. The anharmonic theory gives better agreement 
with experiment except for the specific heat at low temperatures. 


§ 1. InTRODUCTION 


AN excellent review of the theory and properties of solid argon has been 
given by Dobbs and Jones (1957), and the purpose of this communication 
is to discuss in more detail some of the theoretical results given in that 
review. 

The inert gas crystals of which solid argon is a member have been the 
study of many theoretical investigations because of their simplicity. 
Recent experimental measurements of the properties of solid argon by 
Stewart (1955, 1956), Barker and Dobbs (1955) and Dobbs e¢ al. (1956) 
have provided useful material for comparing various models and theories 
concerning crystal lattices. Einstein (1907, 1911 a, b) put forward the first 
reasonably successful approximation to the thermodynamic behaviour of a 
crystal. It was assumed that the constituent particles of a crystal all 
vibrated independently of one another about their mean rest positions 
with the same frequency. Debye (1912) improved this model by taking 
into account in an approximate way the interdependence of the particles. 
A crystal was replaced in this approximation by an elastic continuum 
and the normal modes of vibration of the latter were assumed to represent 
the normal modes of the crystal lattice. More refined attacks deriving 
from the original work of Born and von Karman (1912, 1913) were based 
on solving the dynamical many body problem of a lattice. All these 
approaches had one feature in common. This was that the particles of a 
crystal lattice were constrained to their equilibrium positions by harmonic 
forces. This was equivalent to only considering terms up to the second 
order in the expansion of the potential energy as a Taylor series in the 
small displacements of the particles from their equilibrium positions. 
Although purely harmonic theories cannot account for anharmonic 
properties, it is possible to modify these theories by allowing parameters 


+ Communicated by the Author. ie 
+ Now at Research Laboratories of the General Electric Company Limited, 


Wembley, England. 
ae 


988 I. J. Zucker on the 


such as the Debye value to vary with volume and this can take account of 
properties such as thermal expansion and compressibility. The term 
quasi-harmonic will be used in reference to such theories. 

Quasi-harmonic theories are found to be moderately successful in pre- 
dicting the properties of solid argon at low temperatures, but are much 
less successful at temperatures near the melting point. This suggests that 
some account of anharmonic terms in the potential energy should be made. 
The difficulty of extending the more refined theories of lattice vibrations 
to include anharmonic terms is formidable. Though Born (1951) and 
Hooton (1955 a, b) have made progress in this direction it is not possible to 
apply their results without making other assumptions. But for the 
Einstein model, Henkel (1955) has shown in a particular case how higher 
order terms in the potential energy may be evaluated. It is well known 
that thermodynamic properties of crystals evaluated using different 
harmonic theories give almost the same results. Thus although the 
Kinstein model is rather crude, calculations made with it including an- 
harmonic terms should indicate differences between harmonic and 
anharmonic theories. In this paper a comparison is made between results 
calculated by using a quasi-Debye model and an Einstein model including 
an anharmonic term. Henkel has already made some calculations but 
these will be repeated and amplified using what is believed to be a more 
accurate representation of d(r) the potential energy between a pair of 
argon atoms. This is 


Ap 
d(r)= pine 
A=1°63 x10 AMergs, j= 1:05 10-284 Sercs et 


is determined from solid state data alone, the heat of sublimation and 
lattice constant at absolute zero being the information used. The method 
of obtaining and the reasons for this choice are given elsewhere (Domb 
and Zucker 1956, Zucker 1956), 


§ 2. THEORY 


In the following sections the letter D or H in brackets following a symbol 
referring to a thermodynamic quantity implies that the latter is found by 
use of the quasi-Debye or Henkel theory respectively. 

The free energy of a Debye crystal is given by 


( -O7/ 
P(D)=3N > .d(r) + ~ RO, + oR fe log (1 = ¢—* \as? aren natn 
: Diira0) 
>, indicates summation over all lattice points of a crystal. The first 
term of (2) represents the static lattice energy, the second term the zero- 
point energy and the last term the thermal energy. 6, is defined as 
hy) /kt where v, is the maximum frequency of the elastic continuum 
equivalent to the lattice. Domb and Salter (1952) have given a simple 


{is Planck’s constant and kis Boltzmann’s constant, 


Anharmonic Effects in the Theory of Solid Argon 989 


method of finding @, as a function of volume. They relate vy” to the sum 
of the squares of the frequencies of the normal modes of a crystal. 6, is 
then found in terms of the force constants which are in turn given in 
terms of g(r). Thus 


Bh2N? AEN SAE 
t= srr | Ve O+ 24 | 
En i pak a Ey 
Sa 182 55 eu— 30565 | 


Cyy= 12:06 cg=12-80 

The C,, are numbers obtained by summing inverse powers over all 
lattice points of a face-centred cube in terms of the distance between 
nearest neighbours. ‘These were originally evaluated by Lennard-Jones 
and Ingham (1925). The method of obtaining 6, here is simpler and 
probably more accurate than that proposed by Herzfeld and Goeppert- 
Mayer (1934). It also provides a check on d(r). Putting in the value of 
the nearest neighbour distance at the absolute zero one finds 6, =81, in 
good agreement with the experimental value of 80. 

From (3) all thermodynamic quantities may be determined employing 
the usual thermodynamic relations. 

Henkel extended the Einstein theory by finding the potential energy of 
an atom with respect to all the others up to the fourth order term. He 
did this for a particular form of ¢(r) for a face-centred cubic lattice. This 
may be generalized to any central additive potential and to any lattice. 
The potential energy of an atom is found to be 


V=PotP(v?t+y?+2?) + Pylatt+y*+24) 
tng 7k ewe 
Py=4 P(r), Pot= § deh (r) + = 9(r); 


Py= 5 Sb") 428". PG. ese (4) 


The energy levels for an atom with potential energy given by (4) are easily 
found since the Schrédinger equation is separable into three equations 
each of the form 


Lan Pot Pat) =e (5) 
Sr2m Ox? z 3 

+ It will be observed that P, appears in the formula for 6p. Indeed 6P, 
is the trace of the 3Nx3N matrix obtained by lattice dynamics, the eigen- 
values of which determine the squares of the frequencies of the normal modes 
of the lattice. The difference between the ordinary Hinstein and Debye 
theories is due to the different ways of evaluating the respective characteristic 
temperatures by relating the respective frequency distributions to 6P). he 
difference between 6g and 6p found in this manner is only a numerical factor; 
in fact 6p2=5/30g?. Since the volume dependence is obviously the same the 
only significant differences between the two theories employing characteristic 
temperatures calculated as indicated is in the specific heat at low tempera- 
tures. Here the usual differences between the Einstein and Debye functions 


exist. 


990 I. J. Zucker on the 


This may be solved simply by treating the P, x4 term as a perturbation 
whence the free energy per mode of a crystal in the Henkel approximation 
becomes 


wo l ~ 
F(H)=4N > ,d(r) + “\W- 3RT log > exp (- er) (nW +n2Y), 
n=0 
h 9 Hed 1/2 3h2N2 iar 
=—-—-l eo —— = ae 6 
ie = (57) 672M P, (6) 
It will be observed that when P, is zero the result becomes that for the 


free energy of an ordinary Einstein crystal, and W/k may be identified as 
the Kinstein characteristic temperature. 


§ 3. EVALUATION OF THE THERMODYNAMIC PROPERTIES OF SOLID ARGON 


3.1. Density as a Function of Temperature 


Finding the density is equivalent to finding the equilibrium molar volume 
under zero pressure at various temperatures. Thus P= —(dF/0V) 
was found from eqns. (2) and (6). The value of V which made P zero 
was found in both cases at 10° intervals from 0° to 80°K. 


3.2. Isothermal Compressibility K, at P=0 


kK, was found using the relation K,= — 7(3p) . The values of V 
found already were used. OP} 


3.3. Hxupansivity a, as a Function of Temperature 


a is equal to 1/V (0V/07'), and this is equivalent to K, (OP/0T),. Ky 
having been evaluated, (0P/07'), was calculated, and hence a. 


3.4. Specific Heats as Functions of Temperature 
2 


C, is given by —7' (572), 2nd Cp by 


Cp=C,+ 


Cp was evaluated for both theories using the respective values of Oy, «a, 
V and K,, already found. 

K 7 was also found as a function of pressure at five different temperatures 
but only the Henkel equations were employed in this case. All these 
results have been illustrated graphically in figs. 1-6 and tabulated below. 


§ 4. Discussion or ReEsutts 
4.1. pusT 


Since p at 7’=0 is one of the parameters used in determining ¢(r) it is 
not surprising that the theoretical and experimental values agree at T’=0°. 
This point is in fact the only fixed point in all the calculations described 


Anharmonic Effects in the Theory of Solid Argon 991 


above. In fig. 1 it is seen that the differences between p(D) and p(H) are 
small, but increase rapidly with temperature. p(H) is undoubtedly in 
better agreement with the experimental values of Dobbs ef al. (1956). 
The p(D) are all too small and in fact it was found that at 80°K there was 
no volume at which the crystal was in equilibrium when the pressure is 


Fig. 1 


lec cA: 
Henkel theory. —— —— — Debye 


Density against temperature. 
theory. O Experimental Results—Dobbs et al. 
zero. The value of p(D) given is that which makes the pressure a minimum. 
It may be deduced from this fact that a harmonic crystal is more expanded 
and less stable than an anharmonic crystal. The reason is that for a 
given energy the amplitude of a harmonic oscillator is greater than for an 
anharmonic oscillator. 
4.2. K,vsT 

At low temperatures there is little difference between K ,(H) and 4,,(D) 
but large differences occur near the melting point where K,(D) tends to 
infinityt. The few experimental results available favour K,(H) rather 


+ Herzfeld and Goeppert-Mayer (1934) and Kane (1939) identify this with 
the melting point, but Frenkel (1946) points out that it only defines a certain 
condition of stability of a solid. 


992 T. J. Zucker on the 


than K,(D). Barker and Dobbs (1955) found K, by measuring the 
velocity of ultrasonic waves in solid argon whilst Stewart (1955, 1956) 
found K, using the piston displacement method. Although Stewart's 


Fig. 2 


Ky cms*/ DYNE x 10° 


uw 
° 


To aN 


Isothermal compressibility against temperature. —————— Henkel theory. 


—— — — — Debye theory. O Experimental —results—Stewart. 
@ Experimental results—Barker and Dobbs. 


values at 65°K agree with Barker and Dobbs’, there is a large discrepancy 
at 77°K but as Stewart himself points out his method of finding K,, is not 
trustworthy when K, varies rapidly as it does at high temperature and 
low pressures. It should be pointed out that the values for K 7(H) given 


Anharmonic Effects in the Theory of Solid Argon 993 


here are slightly different from those originally communicated privately 
to Dobbs and Jones but this does not affect their conclusions. 


4.3. avsT 


Again differences between a(D) and a(H) become appreciable at high 
temperatures and again o(H) agrees much better with experiment. The 
values of a(H) given here are also slightly different from those published 
by Dobbs and Jones but again make no difference in interpretation. 


Fig. 3 


40 
WOK 


Volume expansivity against temperature. —————— Henkel theory. 
= Debye theory. © Experimental results—Dobbs et al. 


4.4, C, and Cp vs T 


At low temperatures C',(D) is slightly greater than C,(H). As the 
Henkel theory is just a modified Hinstein theory this is just as might be 


994. I. J. Zucker on the 


expected. But as the temperature rises C(H) does not approach ae 

C,,(D) and the Einstein specific heat do. This is undoubtedly due to the 

effect of anharmonicities which are expected to be more prominent at more 
ated temperatures. 

odie Cs Aree show that at low temperatures C,(D) is in better acta 

ment with experiment, whilst at high temperatures C,(H) is better. is 

may be interpreted as follows. It is known that at low temperatures 


Fig. 4 


_ 
_ 
es 


Cy ercs/om. MOL X 107 


° 20 40 60 80 
TTA 
Specific heat at constant volume against temperature. ——_——— Henkel 
theory. ——— — — Debye theory. 


the ordinary Kinstein theory does not predict the behaviour of specific 
heat as well as does the Debye theory. Further as has already been seen 
anharmonic effects are small at low temperatures. Hence as the Henkel 
theory only modifies the Einstein theory by the inclusion of anharmonic 
effects, it might be expected that C,(D) at low temperatures represents 
the experimental facts more closely. But as the temperature rises to 


Anharmonic Effects in the Theory of Solid Argon 995 


values greater than 6@,/5 (~16A4 for argon) the differences between the 
Debye and the ordinary Einstein specific heats rapidly approach zero. 
Thus any difference between C,,(D) and C,,(H) at high temperatures must 
be attributed to anharmonicity, and here it is observed that the anharmonic 
theory represents C,, more closely. 


Fig. 5 


40 


30 


tn 
ie) 


Cp ERCS/GM. MOL xX 107 


fo) 


2) 20 40 Go 80 
A 
Specific heat at constant pressure against temperature. —————— Henkel 
theory. ————— Debye theory. © Experimental results— 


Clusius. 


4.5. K, as a Function of Pressure 


It was found that at pressures greater than 6000 bars (1 bar = 0-98 
atmospheres) that values of K,, at different temperatures became in- 
distinguishable. The differences between K,,(H) and /’,(D) also rapidly 
approached zero, and the latter have not been illustrated here. The agree- 
ment with Stewart’s results is good except at low pressures, and here the 
criticisms of § 4.2 apply. 


996 T. J. Zucker on the 


Fig. 6 


10 


Q 
x 
w 
Zz 
7 
(2) 
2 
mn) 
E 
10) 
KF 
Ss 
o 1000 2000 3000 4000 5000 6000 
PRESSURE Kgs/Sa.cm. 

Isothermal compressibility as a function of temperature. — — — Henkel 
theory, Kp_77. ————— Henkel theory, K7_.5. ---—-—- Henkel 
theory, Kyi49. ——.—. Henkel theory, K7_»). © Experimental 
results for Kg_j;—Stewart. @ Experimental results for Kp_.;— 
Stewart. 


Table 1 (a) 


K,7(D) Ky(H) 


dynes-! dynes—! 


22-68 
22-88 
23-18 
23-56 
24-06 
24:79 
26-64 


Anharmonic Effects in the Theory of Solid Argon 997 
Table 1 (b) 
PR | Cp(D) | Cy(H) Cp(D) Cp(H) o(D) o( H) 
joules joules joules joules ¢ ve ‘ is 
mole mole mole mole Ax10~* | Ax 10-4 
0 0-0 0-0 0-0 0-0 0-0 0-0 
10 3-44 1-54 3-46 1-57 et 0-73 
20 1272 10-97 13-27 11-32 6-91 5-48 
30 18-35 16-82 20-23 18-15 10-83 9-06 
40 21-05 19-96 25-51 22-48 14:39 11-23 
50 22-61 20-53 29-49 24-43 18-78 13-07 
60 23-48 21-28 34-82 26-56 24-28 14-53 
70 24-08 21-27 39-37 28°31 ST 30 16-31 
80 24-03 20-97 29-33 17:36 
(oe nee Se OS a SO 
Table 2 
Molar 
volume 24-91 23°13 22-54 21-36 20-17 19-00 
(cm§) 
T=(0 
—1790 | —1090 1. 1880 4800 9430 
K(H) 0-907 0-565 367 0-243 0-162 0-108 
T= () 
P=204 
— 1650 — 980 160 1940 4840 9440 
K(H) 0-955 0-584 0-372 0-245 0-163 0-108 
T= 20 
T=404 
—1210 — 540 Silk 2310 5160 9710 
K(A) 0-966 0-596 0-380 0-249 0-165 0-110 
T =40 
JUN TSIN 
le — 630 80 1210 2960 5800 10330 
K(H) 0-894 0-580 0-377 0-250 0-166 0-110 
T= 65 
at A 4 
Ie — 350 370 1530 3290 6130 10700 
K(H) 0-850 0-566 0-373 0-248 0-166 0-110 


P, the pressure, is given in bars or dynes x 10°/em?. ~ , 
Ky, the isothermal compressibility, is given in units of cm?/dyne x | 


)-10 


998 On the Anharmonic Effects in the Theory of Solid Argon 


§ 5. CONCLUSION 


Various properties of solid argon have been evaluated employing two 
theories—the Debye theory and an anharmonic theory developed by 
Henkel from the Einstein model of a crystal. In all cases except for 
specific heats at low temperatures the anharmonic theory gives results 
agreeing more closely with available experimental data. This better 
agreement is most prominent at high temperatures. It would appear 
that anharmonic effects play a prominent part in determining the be- 
haviour of solid argon, especially near the melting point. It is worth 
repeating the observations of Dobbs and Jones that more experimental 
data especially at low temperatures are required, and that an anharmonic 
theory based on a better model than that of Einstein is desirable. 


ACKNOWLEDGMENTS 


The writer is grateful to Professor C. Domb of King’s College, London, 
for many helpful discussions, and to Professor G. O. Jones and Dr. E. R. 
Dobbs of Queen Mary College, London, who communicated many results 
before publication. He is also indebted to D.S.I.R. for a maintenance 
grant for the period during which this work was done. 


REFERENCES 


Barker, J. R., and Dosss, E. R., 1955, Phil. Mag., 46, 1069. 

Born, M., 1951, Fest. Gott Akad. maths. Phys. KL 1. 

Born, M., and von Karman, T., 1912, Z. Phys., 12, 297; 1913, Jbid., 14, 15. 

Ciustus, K., 1936, Z. phys. Chem. B, 81, 459. 

Dupys, P., 1912, Ann. Phys., 39, 789. 

Dosss, E. R., Fraains, D. F., Jonzs, G. O., Prercy, D. C., and Ritey, D. P., 
1956, Nature, Lond., 178, 483. 

Dosss, E. R., and Jonss, G. O., 1957, Rep. Prog. Phys., 20, 516. 

Doms, C., and Satter, L., 1952, Phil. Mag., 48, 1083. 

Doms, C., and ZuckEr, I. J., 1956, Nature, Lond., 178, 484. 

Ce ee ae 1907, Ann. Phys., 22, 180 ; 1911 a, Ibid., 34, 170 ; 1911 b, Zbid., 
35, 679. 

FRENKEL, J., 1946, Kinetic Theory of Liquids (Oxford: University Press). 

Henke, J. H., 1955, J. chem. Phys., 23, 681. : 

Hooton, D. J., 1955 a, Phil. Mag., 46, 422; 1955 b, Ibid., 46, 433. 

HeErzrevp, K. F., and Gorprrrt-Mayer, M., 1934, Phys. Rev., 46, 995. 

Kane, G., 1939, J. chem. Phys., 7, 603. : ‘eo0 

Lennarp-Jonss, J. E., and Incuam, A. E., 1925, Proc. roy. Soc. A, 107, 146. 

STEWART, J. W., 1955, Phys. Rev., 97, 578 : 1956, J. Phys. Chem. Solids { 636 

ZUCKER, I. J., 1956, J. chem. Phys., 25, 915. A 


[ 999 ] 


Some Magnetic Properties of Dilute Ferromagnetic Alloys II} 


By B. W. Loratan, A. C. Ropryson and W. Sucksmrri 
Department of Physics, University of Sheffield 


[Received May 21, 1958] 


ABSTRACT 


Previous experiments (Bate, G., Schofield, D. and Sucksmith, W. 1955) 
have been carried out on the magnetic properties of precipitates of dilute 
ferromagnetic alloys precipitated from solid solution in a non-ferromagnetic 
matrix. This process could be followed from the initial stages of superpara- 
magnetism through single to multi-domain size of the aggregates, and the 
magnetic measurements corelated with particle growth. In the present 
communication, the work is extended to the production of precipitates with 
greater departure from spherical shape produced by cold drawing of suitable 
alloys of the ferromagnetics iron, nickel and cobalt. Magnetic measurements 
on the anisotropic specimens so produced are shown to give evidence for the 
distribution of particle shape, size and structure. The reverse magnetic 
field required to reduce the remanence to zero is shown to be an additional 
useful parameter in these determinations. 


§ 1. INTRODUCTION 


In their bulk polycrystalline forms, the ferromagnetic elements iron, 
nickel and cobalt are magnetically soft having coercive forces of no more 
than a few oersteds. However, if these elements are subdivided into 
sufficiently small particles they may exhibit coercive forces of several 
hundreds and even as high as thousands in certain cases. If such particles 
are sufficiently small to exist as single domains, changes in magnetization 
may only occur by the difficult process of rotation of the magnetization 
vector and not by the easy process of boundary displacement. The high 
coercive forces observed are due to the forces of anisotropy of the particles 
opposing this rotation of the magnetization vectors. The theoretical 
maximum values of coercivity associated with the three forms of aniso- 
tropy are shown in table 1 and are due to the work of Stoner and Wohlfarth 
(1948), Néel (1947a) and Kittel (1949). 

Numerous experimental investigations of the magnetic properties of 
finely divided ferromagnetic powders have been carried out. Bertaut 
(1953) and more recently Meikeljohn (1953) studied the dependence of 
coercive force on particle size and both obtained results supporting the 
theoretical work of Néel (1947b). Considerable information regarding 
the magnetic properties of small particles has been obtained by studying 
a simple model consisting of small amounts of the ferromagnetic elements 


+ Communicated by the Authors, 


1000 B. W. Lothian ef al. on some Magnetic 


dispersed in a non-magnetic matrix. Such a system may be realized by 
the precipitation of small ferromagnetic regions in alloys from super 
saturated solid solutions by annealing. Such solid solutions of a ferro- 
magnetic and a non-ferromagnetic element should have a low solubility 
limit for the ferromagnetic element so that interaction between the 
ferromagnetic elements does not occur. Thus, a detailed investigation 
of the variation of the magnetic properties of copper rich copper— 
iron and copper-cobalt alloys with annealing after quenching was 
carried out by Bate et al. (1955). After quenching these alloys, which 
contained from 0 to 2%, of the ferromagnetic constituent, were not ferro- 
magnetic. However, during annealing at temperatures ranging from 
350°c to 500°C for the copper—cobalt alloys and from 500°c to 800°c for 
the copper-iron alloys, these alloys became weakly ferromagnetic develop- 
ing both remanence and coercivity. With continued annealing, the 
coercivities of these alloys increased to maximum values of 2500e and 
400 oe for the copper-cobalt and copper-iron alloys respectively, at which 
stage the largest proportion of regions behaving as single domains was 
present. The formation of multidomain particles by further precipitation 
was accompanied by a decrease in the coercivity of the alloys. 


Table I 


Crystal Shape Strain 
random oriented random oriented | random oriented 


Tron 160 500 5100 10700 300 600 
Nickel 60 185 1450 3000 2000 4000 
Cobalt 2000 6000 4300 8900 300 600 


All values are shown in Oersteds. 


Although the maximum coercive forces observed for the copper—cobalt 
and copper-iron alloys were only 2500e and 400 0e respectively, it was 
found that the reverse field required to reduce the remanence to zero was 
usually greater than 1000 oe and often as high as 1500 oe for both alloys. 
In ordinary ferromagnetic materials, this reverse field (H,) is usually not 
more than about 30% more than the coercive force (H,). A high value 
of the ratio may be taken to indicate that the hysteresis loop is the resultant 
of high and low coercivity contributions. Furthermore, the ratio of the 
remanent magnetization to the saturation value provides information 
concerning the distribution of the magnetization vectors amongst the 
contributing particles. 

The work described in the present paper dealt with three dilute ferro - 
magnetic alloys consisting of iron in copper, nickel in gold and cobalt in 
copper. In order to modify the shape and orientation of the domains 
suitable specimens of the alloys were cold drawn, the magnetic properties 
being investigated at various stages of reduction, 


Properties of Dilute Ferromagnetic Alloys: II 1001 


§ 2. EXPERIMENTAL TECHNIQUES 
2.1. Preparation of the Alloys 


The alloys were prepared in 100 gram melts by melting the 99-9°/ pure 
constituents in a high frequency induction furnace in an atmosphere of 
argon at a pressure of 5cm of mercury. Alumina crucibles were used in 
the preparation of the copper-iron and gold-nickel alloys, whilst the 
crucibles used for the copper—cobalt alloys were made from pure Acheson 
graphite. 

2.2. Heat Treatment 


The copper-iron and copper-cobalt alloys were forged at bright red 
heat to bars approximately 1 cm in diameter and 7cem in length. Forging 
at bright red heat of the gold—nickel alloys so prepared was found to produce 
large cracks. Consequently, the alternative procedure of melting these 
alloys in a mould of the required shape in the hearth of an argon are furnace 
was adopted. The bars obtained by these processes were then heat treated 
at 1000°c before quenching. Initially, all the alloys were quenched in 
water, but in the case of the copper—cobalt alloys this was found to preduce 
surface strains which affected the rate of precipitation. This was remedied 
in subsequent work by oil quenching. 

Investigations of the effect of further heat treatment on the magnetic 
properties of these alloys were carried out on specimens in the form of 
dises 5mm in diameter and 0:-5mm thick, these having been annealed and 
quenched to remove surface strain produced in preparation. The 
appropriate heat treatments were carried out with the specimens sealed 
into small evacuated hard glass tubes. 

After the bar had been heat treated, it was turned down to 8 mm diameter 
and threaded at one end in preparation for cold drawing. Held by means 
of a threaded steel tube and steel rod, the bar was cold drawn through 
dies. By this method the diameter of the bar was reduced in steps of 
4mm to 4mm and then in }mm steps down to 2mm. At various stages 
of reduction, specimens were cut from the bar with the plane of the disc 
parallel to the direction of drawing. A small arrow was scratched on the 
surface of each disc to coincide with the direction of drawing. 


2.3. Magnetic Measurements 
2.3.1. Determination of the hysteresis loops 


The demagnetization curves were determined by measuring the tractive 
force on a magnetized specimen produced by means of a field gradient. 
This was achieved by means of a torsion balance described by Bate et al. 
(1955). . 

The force F exerted on a specimen of mass M and intensity of magneti- 
zation o in a field gradient dH/dZ is given by oM(dH/dZ). 

dH 


P=oM 7. 


1002 B. W. Lothian et al. on some Magnetic 


The coils magnetizing the specimen to its intensity o and these producing 
the necessary field gradient are separately controlled and collinear. 
This force is balanced by the couple exerted by the torsion fibre of 
constant c, and thus we obtain 
dH 


ch=oM d 


where @ is the angle of rotation of the torsion head necessary to restore the 
balance arm of length d to its zero position. 

The arrangement of the apparatus and experimental technique were 
similar to those described by Bate et al. 


2.4. Saturation Measurements 
The intensity of magnetization of specimens in fields of up to 18000 0e 
were determined using a Sucksmith magnetic ring balance (Sucksmith. 
1929). 
§ 3. EXPERIMENTAL RESULTS 
3.1. Copper—lIron Alloys 
The investigations of the effects of heat treatment and of cold work 
after heat treatment on the magnetic properties of copper rich copper— 
iron alloys were carried out on alloys containing 0-5°%, 1-0°% and 2:0% by 
weight of iron. After annealing at 1000°co for several hours followed by 
quenching, the 0-5% and 1:0% alloys were not ferromagnetic but the 
2-0% alloy showed slight ferromagnetism, having a remanence of c= 0-015. 
(e.m.u. per gram). [If all the iron was dissolved by the heat treatment, 
as will be expected from the equilibrium diagram, some was precipitated 
as body centred iron on quenching. 
The effect of further heat treatment at 650°c on the magnetic properties. 
of these alloys was then studied and the results are summarized in table 2. 


Table 2. The Effect of Heat Treatment at 650°c on the Coercivity and 
Field Required to reduce Remanence to Zero 


Annealing time 
in hours 


4% Fe-Cu 


1% Fe-Cu 


2%, Fe-Cu 


As seen in table 2 the initial coercive force of these alloys in general 
increased to a maximum after about 5 hours heat treatment and then 
decreased with further heat treatment. Similar variations of the field 
required to reduce remanence to zero were observed but the period of 


Properties of Dilute Ferromagnetic Alloys: II 1003 


heat treatment required to give the maximum values was found to be 
dependent on the composition of the alloy. Although the maximum 
values of H,, and H, were almost identical with those observed by Bate 
et al. (1955) the periods required to achieve these maxima were much 
shorter in the present work, e.g. the maximum value for the coercivity of 
a 1% alloy was observed after 65 hours at 650° compared with a period of 
only 5 hours in the present work. This difference in the rates of preci- 
pitation is due to a difference in the preparation of the two sets of alloys. 
The earlier alloys were quenched directly after forging whereas the present 
alloys were heat treated for a further 24 hours at 1000°c before quenching. 
These differences can therefore be attributed to the stresses set up in forging. 

The effect of cold drawing on the magnetic properties of alloys which 
had been heat treated for periods of 2, 5 and 12 hours at 650°c after quench- 
ing was then studied and a summary of the observed effects is given in 
table 3. Cold drawing of quenched alloys produced only small changes in 
the magnetic properties, particularly in so far as no anisotropy developed. 

The results of magnetic measurements on 0:5%, 1% and 2% Fe-Cu 
alloys are shown in tables 3 (a), (b) and (c). After heat treatments of 2, 5 
or 12 hours at 650°c it was found that even small reductions in cross- 
sectional area, e.g. 20°, produced substantial increases in both the 
remanence and the saturation intensity of these alloys. As seen in the 
appropriate columns of table 3, the percentage increase in both og» and 
o, was greatest for the initial reductions and tended to a maximum after 
about 75%. This shows that considerable increase in the ferromagnetism 
has occurred, due to a transformation into the ferromagnetic « form, by 
cold work, of the y iron originally precipitated by heat treatment. In 
addition to this further precipitation, cold drawing of these alloys after 
heat treatment resulted in a highly anisotropic state with the coercive 
force parallel to the direction of drawing being very much greater than in 
the perpendicular direction. As can be seen from the values of Hp» and 
Hx, although H,, approaches the value of the field required to reduce 
remanence to zero, Hz», as the reduction is increased, very little effect on 
the values of H,, was observed. 

Although H,. was hardly affected by continued reduction the effect 
on Hy was very similar to that on the value of Hz». However, the 
magnitude of the effect of cold drawing on Hr was much smaller than on 
Hyp, suggesting that cold drawing only has a small effect on the initial 
ferromagnetic precipitate and that the high coercive forces are due to 
orientation of the new ferromagnetic material. 

The ratio cg/cg, an indication of the amount of reversible magnetization 
which may occur, is also shown in table 3. Before drawing, the value of 
this ratio was low, e.g. ~ 0-1-0-4, indicating a large reversible component. 
Drawing of the alloys was found in general to cause a progressive decrease 
in the ratio ogy/og, whereas the ratio ogp/o, increased, until, with 90% 
reduction, a value approaching unity was produced. These results support 
the evidence from the values of H, and H, that rotation of the precipitate 
without much change in shape is taking place. 

paved 


1004 B. W. Lothian et al. on some Magnetic 


Table 3 


(a) The Variation of Magnetic Properties of a 0-5% Tron—Copper Alloy with Cold Drawing after 
5 hours Heat Treatment at 650°C 


0-0045 | 0-0045 


0-174 | 0-0483 


0-270 | 0-0607 


0-309 | 0-0652 


0-326 | 0-0584 


0-0562 


0-0529 


(b) The Variation of Magnetic Properties of a 1% Iron—Copper Alloy with Cold Drawing after 
5 hours Heat Treatment at 650°C 


0 486 820 | 1:7 486 820 | 1-7 | 0:0495] 0-0495] 0-112 0-44 | 0-44 


24 480 630 | 1:3 280 640 | 2-3 | 0-272 | 0-102 | 0-462 | 0-59 | 0-22 


44 850 | 1000 | 1-2 550 | 1150 | 2-1 | 0-529 | 0-197 | 0-775 | 0-68 | 0°25 


61 950 | 1250 | 1-3 460 | 1150 | 2-5 | 0-705 | 0-179 | 0-933 0:76 | 0-19 


Td) | LAVOF L300) | 12 430 | 1200 | 2:8 | 0-820 | 0-146 | 1-04 0:79 | 0-14 


86 | 1265 | 1380 | 1-1 410 | 1390 | 3-4 | 0-910 | 0-152 | 1-06 0:86 | 0-14 


94 | 1170 | 1260 | 1-1 300 | 1160 | 3:9 | 0-973: | 0-125 | 1-06 0-92 | 0-12 


(c) The Variation of Magnetic Properties of a 2% Copper-Iron Alloy with Cold Drawing after 
5 hours Heat Treatment 


of Hrpp H o. ORN 
0 Hin H. H H RN RP R} 
COP RP TT. CN RN ae G ORN G 7 ——- 

Hop Ho RP Rd $s os os 


0 210 830 | 4:5 210 830 | 4:5 | 0-063 | 0-063 | 1-26 0-048 | 0-048 


24 520 670 | 1-3 250 740 | 3-0 | 0-905 | 0-307 | 2:88 0:314 | 0-120 


44 740 910 | 1-2 306 | 1200 | 3-9 | 1-47 0-388 | 3:24 0-455 | 0-120 


61 700 860 | 1-2 200 950 | 4:7 | 1:69 0-281 | 3°57 0-474 | 0-079 


75 790 | 1000 | 1-3 365 | 1300 | 3-6 | 1-73 0-472 | 3:63 0-477 | 0-130 


86 840 | 1100 | 1-3 310 | 1210 | 3:9 | 1-54 O:o17 3°63 0-477 | 0-142 


94 800 | 1070 | 1:3 310 | 1320 | 4:3 | 1-69 0-506 | 3-51 0-481 | 0-144 


H cp—Coercivity parallel to the direction of drawing. 
Hox—Coercivity perpendicular to the direction of drawing. 
Hyp—Reverse field required to reduce remanence to zero parallel to drawing. 


Hyx—Reverse field required to reduce remanence to zero perpendicular to drawing. 
op,—Remanence. 


og;—Saturation intensity. 


Properties of Dilute Ferromagnetic Alloys: II 1005: 


In general the contributions to the coercive force arising from all three 
forms of anisotropy will be influenced by cold drawing. Crystal aniso- 
tropy will be affected according to the structures of the two phases con- 
cerned. Thus in the case of face-centred cubic structures the (100) and 
<111) axes, whereas in body centred cubic materials the (110) axes are 
aligned parallel to the direction of drawing, the contributions from shape 
anisotropy may be two-fold. There may be elongation of the particles 
caused by the drawing process, and there may be orientation of the particles 
without deformation. From the measurements of coercivity and mag- 
netization referred to above, it appears unlikely that there is much change 
of shape, but the increase of coercive force in the direction of drawing (as 
well as the decrease in the perpendicular direction) would both result from 
the cold work aligning the long axes of the particles along the direction of 
drawing. Although strain effects will be present, it does not seem likely 
that will play a major part in so far as directional properties are concerned. 
Of the possible causes of the changes in magnetic properties, the evidence 
points towards the following explanation. Before cold work, there is a 
considerable amount of face-centred iron which is coherent with the copper 
rich matrix. During the drawing process, these copper rich grains, and 
probably those coherent with them, are turned into the direction of drawing 
and at the same time transformed into body-centred cubic iron, so that 
finally we have ferromagnetic grains most of which have been elongated 
to a greater extent than prior to drawing. At the same time, most of these 
have their long axes along the direction of drawing. 


3.2. Copper—Cobalt Alloys 


The variation with heat treatment of the magnetic properties of quenched 
copper-rich copper—cobalt alloys containing 19% and 14% by weight of 
cobalt was investigated. The effect of cold drawing after heat treatment 
was also studied. Some measurements were carried out on a 4% cobalt— 
copper alloy. 

By quenching the 1% and 14° alloys in oil after 24 hours annealing at 
1000°¢ all the cobalt was retained in solid solution and the alloys exhibited 
paramagnetic behaviour with no signs of ferromagnetism. After annealing 
at temperatures ranging from 350°c to 450°c, the ferromagnetic charac- 
teristics of remanence and coercivity were developed. The variations of 
magnetic properties for a 1% alloy are summarized in table 4. The 13% 
alloy behaved in a very similar manner. 

Short periods of heat treatment were found to develop low coercive 
forces of about 50 oe which increased to a maximum on the continued heat 
treatment. This maximum was dependent on the temperature of heat 
treatment, e.g. for the 1% alloy the maximum coercive force for heat 
treatment at 300°C was 145 0e, whereas at 380°C it was 650e. Although 
the values of H,, were comparatively low, i.e. compared with the results 
for iron, the values of H,, were very high with values of up to 1300 oe being 
observed. As seen in table 4 the values of the ratio H,/H, were very 
high with values always greater than seven, and often greater than 10. 


1006 B. W. Lothian et al. on some Magnetic 


This suggests that single domain behaviour is involved in the high values of 
H,, and also that a considerable number of the precipitated regions were 
magnetically soft, having coercivities less than 1000e. Particles the size 
of which is either less than, i.e. subdomain, or much greater than, i.e. 
multidomain, single domain size constitute ‘soft’ regions. Such regions 
will also be associated with the very low values of o,/o, which indicates a 
large proportion of reversible magnetization. 


Table 4. The Variation of Magnetic Properties with Heat Treatment at 
350°c of a 1% Copper—Cobalt Alloy 


Although the general variation of the magnetic properties of these alloys 
with heat treatment was similar to that observed by Bate et al. (1955), 
the rate of precipitation and the maximum coercivities were different in 
the two cases. This difference may be accounted for by the fact that, in 
the earlier work, heat treatments were carried out on specimens imme- 
diately after cutting from the forged bar, whereas in the present work 
specimens were quenched after preparation to remove surface strain 
before the heat treatments. 

The results of cold drawing on a 14% cobalt—copper alloy are shown in 
table 5. Before drawing, the specimens were magnetically isotropic with 
low values of H,, and high values of H,. The drawing produces virtually 
no change in either the coercivity, nor the value of the field required to 
reduce the remanence to zero, irrespective of the direction concerned. 
The values of the saturation intensity show that no further precipitation 
occurred with successive cold work. Little magnetic anisotropy is 
developed in this system, though there is certainly indication of greater 
irreversibility in the drawing direction. Microscopic examination of 
specimens which had been polished and etched showed that cold drawing 
did not affect the shape of the smaller particles, whilst the larger particles 
which were elongated in the direction of drawing. Results for a 1% 
Co—Cu follow closely behaviour similar to the 14°% alloy. 

The results for the 4%, copper—cobalt alloys were however somewhat 
different from those for the 1% and 14% alloys in that after quenching in 
oil from 1000°c this alloy was ferromagnetic with a saturation mag - 
netization of o,~5e.m.u./g. This is not unexpected since the solubility 
limit only reaches 4% at about 1000°c. The effects of cold drawing on the 


Properties of Dilute Ferromagnetic Alloys: II 1007 


quenched alloy were investigated and the results observed are summarized 
in table 6. 

Cold drawing of the 4% alloy increased the low value of H,, observed 
for the undrawn material by similar amounts in both the parallel and 
perpendicular directions for reductions of up to 80%. However, on 
further reduction in area H. was increased by a greater amount than 


Table 5. The Effect of Cold Drawing on the Magnetic Properties of 
a 1$% Cobalt—Copper Alloy after 46 hours Heat Treatment at 400°c 


0-014 | 0-014 


0-007 | 0-0077 


0-0038 | 0-0031 


0-0066 | 0-0029 


0-0072 | 0-0031 


0:0080 | 0-0028 


Table 6. The Effect of Cold Drawing on the Magnetic Properties of 
a 4% Copper—Cobalt Alloy after Quenching in Oil 


H,., the actual values of H.p» and Hoy after 96% reduction were 310 0e 
and 1500e respectively. The drawn material was almost isotropic with 
respect to the values of H,, and apart from the values observed for material 
which had received 96° reduction in area, the values of H,./H, remained 
almost constant. Although no appreciable effect on the saturation 
intensity nor the values of ogy was observed, the value of opp was in- 
creased by a factor of 10 by 96% reduction. This gave a corresponding 
increase in the value of o,p/c, but even after 96% reduction the value was 


only 0-30. 


1008 B. W. Lothian et al. on some Magnetic 


For such small amounts of cobalt as were involved in the alloys studied 
here it is not possible to obtain an x-ray determination of the structure 
of the precipitated regions. Although the evidence is in favour of the 
structure of the precipitated regions being face-centred cubic, the possi- 
bility of it being hexagonal does exist. The two most interesting features 
of the magnetic properties of the alloys were the high values of H;,/H, and 
the low values of o,/os. Considering the former, while any one of the 
three forms of anisotropy may give contributions sufficiently high to 
explain the low H, values, it is not easy to explain the values of Hy. 
However, these high values may be explained by assuming that a small 
amount of the precipitate consists of single domain hexagonal cobalt 
regions from which quite considerable contributions to the H,, values may 
arise from crystal anisotropy. Since the hexagonal basal plane tends to 
lie along the drawing direction for many hexagonal metals, this will tend 
to enhance the contribution to the coercivity from this source. Further- 
more, as the coercive force of a random assembly of single domain cobalt 
particles having an axial ratio of only 1-2 may be as high as 6000e, a 
certain contribution is almost inevitably due to shape. If the remainder 
of the precipitate is magnetically soft, this might account for the low 
values of o,/o, involved in determining the values of Hp. The fact that 
the remanence along the direction of drawing is appreciably higher than 
the corresponding value in a perpendicular direction points to change in 
the shape factor caused by drawing. Owing to the low disregistry of the 
two precipitates, it would appear likely that elongation of the particles 
might be a more important factor than for the case of the copper iron 
system. 


3.3. Gold—Nickel Alloys 


Investigations of the variation of the magnetic properties of gold-rich 
gold-nickel alloys with heat treatment and also with subsequent cold 
drawing were carried out on alloys containing 6% and 9% by weight of 
nickel. 

In early experiments, heat treatments were carried out on specimens 
immediately after cutting from the quenched forged bar. Before heat 
treatment these specimens were not ferromagnetic, but became so during 
heat treatment. Later, as for copper—cobalt alloys, all specimens were 
quenched from 1000°c in water before heat treatment and all subsequent 
heat treatments were carried out on such quenched specimens. 

The effect of heat treatment at temperatures ranging from 300°c to 
400°C was studied for both alloys and the results obtained by heat treat- 
ment at 350°c of the 9° alloy are summarized in table 7. 

After short periods of heat treatment, the materials which were then 
magnetically isotropic, served coercive forces of about 2000e. With 
further heat treatment this value increased to a maximum of about 300 0e 
and then decreased. The values of H, showed similar variations to those 
of H,, and although H, was greater than H ¢ by 100-150 oe, the difference 


Properties of Dilute Ferromagnetic Alloys: II 1009: 


between the two was not as large as observed for other dilute ferromag - 
netic alloys. The value of H,/H, was only slightly affected by heat 
treatment, decreasing from 1-9 down to 1:2 and then remaining constant. 
For the 6% alloy H,/H, was constant having a value of 1-3. 


Table 7. The Variation of Magnetic Properties with Time of Heat 
Treatment at 350°c of a 9% Gold—Nickel Alloy 


Time Ay OR 
(hr) Hy Hy, Ay OR Os ea 
2 210 400 1:90 0-0004 0-010 0-040 | 
4 235 410 1-75 0-0010 0-012 0-083 
8 275 410 1-50 0:0025 0-014 0-180 
16 285 385 1-30 0-0263 0-068 0-390 
32 295 360 1-20 0-1180 0-285 0-410 
64 230 280 1-20 1-350 2-900 0-470 


Table 8. The Effect of Cold Drawing on the Magnetic Properties of 
a 9% Gold—Nickel Alloy after 30 hours Heat Treatment at 350°C 


Diam.| % H H o G 
Sart Red Hop | Arp ie Hox | Ary a orp oRN og = = 
8 0 305 410 1:3 305 410 1:3 | 0-186 0-186 0-440 0:42 0:42 


| 24 490 890 | 1:8 645 735 | 1-1 | 0-121 | 0-279 | 0:520 || 0-23 | 0-54 


6-5 34 450 800 | 1-8 605 690 | 1-1 | 0-108 | 0-208 | 0-400 | 0-27 | 0-52 


6-0 44 430 685 | 1-6 570 675 | 1-2 | 0-156 | 0-256 | 0-490 | 0-32 | 0-52 


5:5 53 385 585 | 1-5 510 605 | 1:2 | 0-212 | 0-226 | 0-550 | 0-39 | 0-49 


4-5 69 365 570 | 1-5 515 630 | 1:2 | 0-182 | 0-307 | 0-610 | 0:30 | 0-50 


4-0 75 305 444 | 1-4 410 540 | 1-3 | 0:242 | 0-236 | 0-570 | 0-42 | 0-41 


3-0 86 295 485 | 1-6 450 575 | 1-3 | 0-142 | 0-202 | 0-450 | 0-32 | 0-45 


2-1 93 300 400 | 1:3 350 500 | 1-4 | 0-257 | 0-190 | 0-530 | 0-48 | 0:36 


Although microscopic examination of specimens after polishing and 
etching showed the particles to be spherical with sizes varying from well 
below to well above the critical single domain size, very small deviation 
from spherical shape is sufficient to give coercivities ~ 1000e. Due to this 
low value, contributions to the coercivity of these alloys may arise from 
any of the different sources of anisotropy. The value of 1-3 for the ratio 
of H,,/H, is not significant, since very small amounts of the low coercivity 
regions are required to give this value for H,/H, (Miekeljohn 1953). 

As seen in table 7 the values of o/c, increase from a very low value 
after short periods of heat treatment to values approaching 0-d after 


1010 B. W. Lothian et al. on some Magnetic 


further heat treatment. This is in agreement with the value of og/o, = 90-5 
predicted by Stoner and Woblfarth (1948) for a random assembly of single 
domain particles. 

The results for the 6% alloy were very similar to those for the 9% alloy. 
The maximum coercivities were obtained at temperatures of 350°c and 
380°c for the 9% and 6% alloys respectively. 

After the alloys had been heat treated to give the maximum coercive 
force, the effect of cold drawing on their magnetic properties was studied. 
The results obtained are summarized in table 8. 

Before cold drawing, these alloys were isotropic with values of H,, and 
H,, of ~3000e and 4000e respectively. Cold drawing of the alloys 
resulted in anisotropic material with increased values of H,, and H, both 
parallel and perpendicular to the direction of drawing. The values of H, 
and H, both increased to maximum after 25% reduction and on further 
reduction decreased. As seen in the respective columns of table 8, Hoy 
was always greater than the corresponding value of H.», the difference at 
their peak values being about 1500e. The drawing was found initially to 
increase the value of H,»/H,p whilst that of H,y/H oy was decreased, 
these values remaining reasonably constant during the reduction. As 
has been pointed out above, both the (100) and ¢111) directions tend to 
align along the direction of drawing for face-centred cubic material, the 
proportions following one particular direction varying from one metal to 
another to that given above. The effect of drawing will therefore tend 
to change from a random distribution. If the proportions are of the 
same order of magnitude then the new distribution will show no particular 
preponderance for any crystal axis to lie either parallel or perpendicular 
to the drawing direction, i.e. approximate isotropy. 

The results for the 6°% alloy were very similar to those observed for the 
9% alloy. 

As the preferred orientation assumed by materials during cold drawing 
is a gradual process which may not be completed until the material has 
been heavily reduced in area. it appears that the initial large increases in 
the values of H, and H, are not associated with this type of process. 
Considering the values of coercive force associated with the various forms 
of anisotropy as shown in table 1, it appears that strain may play an 
important role in these initial increases. After a certain amount of re- 
duction, further drawing will destroy any coherency between the pre- 
cipitate and matrix. This process will probably be accompanied by a 
reduction in strain, and hence coercive force. The observed reduction in 
the values of H,, and H, with reductions greater than 25° may be asso- 
ciated with the destruction of coherency and the orientation of the pre- 
cipitate according to the double drawing texture of face-centred cubic 
material. This view is supported by the facts that the decrease was 


gradual and the materials became gradually more isotropic as the 
reduction in area progressed. 


Properties of Dilute Ferromagnetic Alloys: II 1011 


§ 4. Discussion 


The variation of the magnetic properties with heat treatment was very 
similar for the three alloy systems studied and may be explained in terms 
of the growth of small ferromagnetic regions in a non magnetic matrix. 
The maximum coercive forces observed for these three alloy systems 
suggest that the higher coercive forces are associated with large dis- 
registry. The values of H,,... and the corresponding disregistry values 
are given in table 9. 

Table 9 


H Cmax Disregistry 


480 oe 25%, 


150 oe 15% 
325 oe 10% 


As discussed by Geisler (1953) large disregistry leads to a tetragonal 
structure as opposed to a simple cubic structure for coherent growth of 
the precipitate. In these alloys this gives rise to large internal strains 
and larger values of the crystal anisotropy constants than observed for 
simple cubic material. This means larger contributions to the coercive 
force from the relative anisotropies as shown in table 1. 

Cold drawing of these alloys after heat treatment resulted in materials 
which were magnetically anisotropic, the anisotropy being more pro- 
nounced in the copper-—iron alloys, where H,,,>H,. than in the other two 
systems studied. It is interesting to note that for materials involving 
precipitated regions of face-centred cubic structure the anisotropy pro- 
duced by cold drawing was much less than for the one involving regions of 
body-centred cubic structure. Considering this and the fact that apart 
from the 4% copper—cobalt alloy little preferred shape orientation, or 
shape deformation was observed, it may be that the anisotropy produced 
was associated with crystal anisotropy and the drawing texture of the 
materials involved. Only in the case of the copper-iron alloys did cold 
drawing cause further precipitation of the ferromagnetic region. This 
was due to the transformation of the non-magnetic y iron to the ferro- 
magnetic « iron which is body-centred cubic. 

Preliminary microscopic examinations were carried out on all three 
alloy systems. For the iron—copper alloys, the electron microscope photo- 
graphs did not indicate any marked anisotropy, but the average diameter 
of the particles at maximum coercivity was about ten times larger than 
the generally accepted value for isolated single domain particles of iron. 
For the cobalt—copper alloys it appeared that cold drawing had little effect 
on the shape of the smaller particles, but larger ones—again considerably 
bigger than simple domain size—were elongated in the direction of drawing. 
In the case of the nickel-gold system, microscopic examination proved 
more difficult on account of the extreme softness of the matrix, but optical 


1012 On some Magnetic Properties of Dilute Ferromagnetic Alloys: II 


microscopy revealed that as in the case of cobalt—copper alloys the larger 
particles were elongated in the drawing direction. The most important 
generalization to be drawn from these preliminary experiments indicated 
that the average particle size is much higher than would have been expected 
from single domain considerations. This and other aspects of the results 
are being further investigated. 

Although it is true to say that contributions to the coercive force of these 
alloys, both drawn and undrawn, will arise from each of the three forms of 
anisotropy, it is not yet possible to make a satisfactory estimation of the 
relative contributions from each. However, further advance towards 
such an estimation may be made from further work, such as study of the 
temperature dependence of the magnetic properties of these alloys. 


ACKNOWLEDGMENTS 


We are grateful to Dr. E. O. Hall and Professor R. W. K. Honeycombe 
for their help with the preparation and examination of specimens by means. 
of the optical and electron microscopes. 


REFERENCES 


Bate, G., SCHOFIELD, D., and SucksmitTH, W., 1955, Phil. Mag., 46, 621. 
Berravt, F., 1953, C. R. Acad. Sci., Paris, 229, 417. 

GEISLER, A. H., 1953, Rev. mod. Phys., 25, 316. 

Kitts, C., 1949, Rev. mod. Phys., 21, 541. 

MEIKELJOHN, W. H., 1953, Rev. mod. Phys., 25, 302. 

Néex, L., 1947 a, C.R. Acad. Sci., Paris, 224, 1488; 1947 b, Ibid., 224, 1550. 
ee C., and WoutrartH, EH. P., 1948, Phil. Trans. roy. Soc. A., 240, 
SucksmitH, W., 1929, Phil. Mag., 8, 158. 


Palolse 


A Note on Transition Metal Alloys} 


By C. W. Hawortx and W. Hums-Rotusry 
Department of Metallurgy, University Museum, Parks Road, Oxford 


[Received May 14, 1958} 


ABSTRACT 


The composition limits of the different crystal structures found in the 
alloys of Transition Elements of Groups VA to VITIC are discussed. The 
alloys of these elements give rise to phases with crystal structures of the o, 
c.p. hexagonal, «Mn, and BW types. The composition limits of these are 
often related to the Group Numbers of the constituent elements. If the 
equilibrium diagrams are drawn with the element of lower Group Number 
on the left, passage across the diagram from left to right results in the 
occurrence of different phases in the following order: 


b.c. cube >BW—-o+>«Mn-—c.p. hex.—f.c. cube. 


In all systems one or more of the above phases is absent, but the 
characteristic order is retained. In the alloys of elements of Group VIIIC 
with the elements in the middle of the transition series, there is a general 
tendency for the primary solid solution in the element of Group VIIIC to 
be greater than that in the element of the earlier Group, and an explanation 
of this is advanced. 


$1. Types oF STRUCTURE 


THE object of the present note is to discuss the composition limits of the 
different crystal structures found in the alloys of the Transition Elements 
of Groups VA-VIIIC. The electron theory of these metals is still a 
matter of acute controversy, and only a few of the binary equilibrium 
diagrams are accurately known. However, when the existing diagrams 
are compared systematically, some general principles. or tendencies are 
now apparent. 

For convenience we shall describe these first in terms of the Group 
Number of the elements, and regard this as increasing from IV (Ti, Zr, Hf) 
to VIIIC (Ni, Pd, Pt). All diagrams are drawn with the Group Number 
increasing from left to right. In these diagrams, on passing from left 
to right, the number of electrons outside the inert gas shell increases, 
and the number of holes in the d shell decreases. If a viewpoint of the 
Pauling type is adopted, the number of vacancies in the atomic orbitals 
decreases on passing from Group VI to Group VIIIC; there are no atomic 
orbitals in the elements of Groups IV or V. 

In the Second and Third Long Periods the crystal structures of the 
pure metals show a clear change with increasing Group Number from 


b.c. cube (Groups V, VI)+e¢.p. hex. (Groups VIT, VILLA) > 
f.c. cube (Groups VITTB and C). 


+ Communicated by the Authors. 


1014 GC. W. Haworth and W. Hume-Rothery on a 


In the First Short Period, the b.c. cubic structure is again found in Groups 
V and VI, but continues into Groups VII (Mn) and VIII (6, aFe). The 
f.c. cubic structure is again found in Groups VIIIB and C, but extends 
backwards into Groups VIIIA (yFe) and VII (yMn). The c.p. hexagonal 
structure is found in Group VIIIB (Co), but it is probable that this is not 
strictly comparable with the c.p. hexagonal structures of Ru and Ost. 

That Group Numbers VI and VII represent a critical stage in the 
transition process is suggested by the existence of the numerous 
allotropes of Mn, and by the formation of characteristic o structures in 
alloys of which one metal is of Group Number lower than VII, and the 
other is of Group Number VII or higher. It is to be noted that the o 
structure is also formed by the pure metal BU in Group VI, although here 
the interpretation is less clear, because of the actinide process. 

Recent experimental work has shown that, apart from the o structures, 
these alloys may give rise to phases with (a) c.p. hexagonal, (6) a—Mn and 
(c) B-W types of structure, and that the composition limits of these are 
often clearly related to the Group Numbers of the constituent elements. 
The present examination suggests that on proceeding from left to right 
(i.e. with increasing average Group Number) across the diagram, the 
different phases occur in the following order: 


b.c. cube >BW-+c-+aMn-c.p. hex.>f.c. cube. . . . (1) 


In all systems one or more of the above phases are absent, but the 
characteristic order is retained. The general tendencies are seen most 
clearly by considering the alloys of elements of Groups V and VI in 
separate diagrams. 


1.1. Alloys of Elements of Group VI 


Figure 1 indicates the approximate composition limits of the different 
phases in the alloys of Cr, Mo and W with the sequences of elements 
(Mn, Fe, Co, Ni), (Tc, Ru, Rh, Pd), and (Re, Os, Ir, Pt). 

In the alloys of Cr, the o-phase fields show the characteristic shift from 
right to left on passing from Mn+Fe+Co, and from Re+Ost. The 
c.p. hexagonal or e-phases move in the same direction on passing from 
Ru+Rh. The B—W phases are at approximately the same composition in 
the alloys with Ru, Rh, Ir and Pt. In all systems the order of the phases 
is that of relation (1). 

In the alloys of Mo, the c-phases show the same characteristic shift from 
right to left in the sequences Mn+Fe+Co, and Re+Os->Ir. The «-phases 
show the same shift from right to left in the sequences Ru>Rh->Pd, and 
Re+Os-+Ir, although there is little change in the mean composition of 


the e-phase on passing from Ir+Pt. In all cases the order of the phases 
is that of relation (1). 


a 


} The axial ratios for Ru and Os are appreciably smaller than that for ¢.p 
spheres in contrast to the value for Co. 


{ This has been pointed out by previous authors. 


Note on Transition Metal Alloys | 1015. 


In the alloys of W, the o-phases show the characteristic shift from right. 
to left in the sequences Fe+Co, and Re+Os-+Ir. The e-phases show the 
same shift in the sequences Ru+Rh, and Os+Ir. The order of the phases 
is always that of relation (1). 


1.2. Alloys of Elements of Growp V 


Figure 2 shows the approximate composition limits of the different 
phases in alloys of V, Nb, and Ta with the same elements as those in fig. 1. 
In all cases the order of the phases is that of relation (1). 

In the alloys of V, the c-phases show the characteristic shift in the 
sequence Mn-Fe+Co—Ni, and the ¢«-phases show a shift in the same 
direction in the sequence Ru+Rh. In the next Period, however, the 


1016 CG. W. Haworth and W. Hume-Rothery on a 


solid solution of V in the c.p. hexagonal Re is of wide extent, and on 
passing from Re-Os the mean composition of the e-phase field may move 
from left to right. 

In the alloys of Nb, the c-phases show the characteristic shift in the 
sequence Re+Os+Ir, although there is little change in the compositions 
of the o-phases in the sequences Rh+Pd and Ir+Pt. The a—Mn phases 
show a shift in the same direction in the sequence Re-Os, although the 
mean composition of the primary « solid solutions may change very slightly 
in the reverse direction. 


Ow mM 2 


oe 
SK 


> 
OY 
eh sen 
ah ee 
onns 
= ye) 
oO 


The alloys of Ta are very incompletely known, but the o-phase fields 
‘show the usual shift on passing from (Re, Os)-+(Ir, Pt), although there is 
little change on passing from Re-+Os or from Ir+Pt. 


Note on Transition Metal Alloys 1017 


From the above survey and from the data of figs. 1 and 2, the following 
general conclusions may be drawn : 


(a) The compositions of the BW phases are always very near to the 
ratio A,B, where A is the element from Group V or VI, and B is from 
Group VIII. 


(6) With very few exceptions, the compositions of the o, «Mn, and 
e-phases, in the alloys of any one metal in Groups V or VI, show a general 
shift from right to left as the Group Number of the second metal increases 
from Group VIT+VIITTA+VILTIB+VITIC. 


(c) The order of the phases given in relation (1) above is always obeyed. 


§ 2. ELECTRON CONCENTRATION EFFECTS 


The changes in composition referred to in (b) above are in the direction 
which would be expected if each of the different structures were favoured 
by a characteristic electron : atom ratio, where the number of electrons 
is that outside the inert gas shell. Attempts to correlate the compositions 
of the o-phases in this way have been made by several authors, but a 
detailed examination shows that, although a general correspondence does 
exist in many cases, it is not possible to account for all the composition 
ranges in terms of a single electron: atom ratio. 

The vacancies in the atomic orbitals postulated by Pauling diminish 
regularly on passing from Group VI to Group VIIIC, and the correlation 
in terms of electron : atom ratios is thus accompanied by a correlation 
in terms of Pauling vacancies, but again the facts cannot all be covered 
by a simple assumption. 

A detailed exAmination suggests that the composition ranges, not only 
of the c-phases, but of all the phases in relation (1) do show a rough 
correlation with electron atom ratios, and that this is shown most clearly 
by considering the Group V and Group VI elements separately. The 
following table refers to alloys of these elements with those of Group VII 
and VIII. These values must be looked upon as approximate only, 
because many of the equilibrium diagrams have not been determined 
accurately. 


Table 1 


Type of structure Group V metals Group VI metals 


b.c. cubic 5-5-2 6-6-2 
BW 5-8-6°3 6-4-7 
o 6-7:-4 6-5-7:6 
aMn 6:-5—7-2 6-7-7 
G.p. hex. 6-2-8 7-8 
f.c. cubic 7-5-10 7-5-10 


1018 C. W. Haworth and W. Hume-Rothery on a 


1 
§ 3. Toe MutuaL SOLUBILITIES OF TRANSITION METALS IN THE SOLID 
' STATE 


Recent work by the authors on the constitution of molybdenum alloys 
has shown the maximum solubility of molybdenum in palladium to be 
much greater than that of palladium in molybdenum. _Examination of 
the existing data suggests that this is a general characteristic of the alloys 
of Group VILIC with the elements in the middle of the transition series. 


Table 2 


Solubility in Atomic’, in Solubility of Group VITIC 


pyetou element of Group VIIIC element in second metal 
Ni-Cr 46 33 

Ni-Ti 13-8 10:3 

Ni—Mo ca 25 small 

Ni-Nb 10 <7 

Ni-W 17-5 ca. 4 

Pd—Cr 60 b- 

Pd—Mo ca. 40 ca. 5 

Pd—W ca. 20 probably small+ 
Pt—Cr 75 or 62f small 


+ The lattice spacings of the W-rich phase in 2-phase alloys were indistinguish- 
able from that of pure W. 

t The value 75 atm% Cr is from the diagram in Vol. 1 of Smithells’ 
Reference Book. The alternative value of 62% is from Raub and Mahler (1955). 


Table 2 shows the data for the alloys of elements of Group VIIIC with 
those of Groups IVA, VA, and VIA, and the general tendency is clear, 
although the difference in solubilities is less where both elements are in 
the First Short Period. This tendency would not be expected from the 
Pauling scheme which allots the same valency of 5-78 to all the metals 
from Groups VIA to VIIIC. It would, however, agree with the views 
of Hume-Rothery et al. (1951) and of Altmann e¢ al. (1957) according to 
which the valency (in the sense of the number of bonding electrons) 
diminishes on passing through Group VIII, and the bonding orbitals 
involve the greatest proportion of d function in Group VI. If in Group 
VIIIC some of the d electrons have entered atomic and non-bonding 
orbitals, it may not be possible for these atoms to give rise to the type of 
bonding required for the crystal structure of the Group VI element, 
although there will be no corresponding restriction to prevent the atom 
of the Group VI element from entering the lattice of an element of 
Group VIIIC. In this way the marked difference between the extents of 
the primary solid solutions at the two sides of the equilibrium diagram 
can be understood, and it should be emphasized that the difference is 
shown in cases (e.g. Mo-Pd) where the size factor is extremely favourable. 


Note on Transition Metal Alloys 1019 


$4. SourcES or Data 
Most of the data on the formation of intermediate phases has been 
taken from a paper by Greenfield and Beck (1956), and a more recent 
report from the A.E.1. Laboratories by A. G. Knapton (1957). Reference 
has also been made to the work of Raub (1954) and Raub and Walter 
(1951), and to the Metals Reference Book by C. J. Smithells. The data 


on Mo-—Pd is from unpublished work in Oxford by C. W. Haworth and 
W. Hume-Rothery. 


REFERENCES 

AtTMANN, S. L., Counson, C. A., and Hums-Rotugry, W., 1957, Proc. roy. 
Soc. A, 240, 145. 

GREENFIELD, P., and Brcx, P. A., 1956, J. Metals, 8, 265. 

Hume-Rotuery, W., Irvine, H. M., and WruuiaMs, R. J. P., 1951, Proc. roy. 
Soc. A, 208, 431. 

Kwapton, A. G., 1957, A.H.I. Report, No. A. 682. 

Rave, E., 1954, Z. Metallk., 45, 23. 

Ravp, E., and Mantsr, W., 1955, Z. Metallk., 46, 210. 

Raup, E., and WatTtER, P., 1951, W.C. Heraeus Festschrift. 


3 Zi 


PP ID20 ale 


A New Method for the Evaluation of Electric Conductivity 
in Metals} 


By 8. F. Epwarps 
Department of Mathematical Physics, University of Birmingham 


[Received May 23, 1958] 


ABSTRACT 
A method is developed which allows the evaluation of the closed formal 
expressions for electrical conductivity which have recently been developed 
by several authors. The case of a random set of scatterers is treated in 
detail and the formal solution made to yield directly the solution to the 
Boltzmann equation. A brief mention of the application of this method to 
liquids and alloys is made. 


§ 1. InrRoDUCTION 


RECENTLY it has been realized by several workers (Nakano 1956, Kubo 
1956, Kohn and Luttinger 1957, Greenwood 1958), that the electric 
conductivity in, say, a metal can be written down in a closed formal 
expression, without going through the intermediate form of deriving a 
transport equation, and moreover these closed forms are exact. The 
usual derivation of a transport equation (cf. Peierls 1955) is rather limited 
in its applicability and cannot in any simple way be extended to the cases 
of alloys and liquids etc., and moreover, even where it is usually used, it 
is not at all clear (see e.g. Peierls 1955, p. 123) that there are not temperature 
dependent corrections which would entirely invalidate the usual solution. 
Now the formal exact solutions avoid all this, but carry the difficulty 
that they are still in a rather abstract form, and it is not clear how they 
are to be evaluated. This paper is concerned with the evaluation of 
these formulae, and will show that they can readily give the same result 
as the usual transport equation where the latter has been assumed to be 
correct, and thus dispose of the possibility of temperature dependent 
corrections. ‘The use of the exact formulation in new problems will only 
be very briefly mentioned in this paper, and since the present object is 
only to illustrate the method, the simplest problem, that of the conductivity 
of electrons scattered by a random set of scattering centres, will be 
discussed. 
§ 2. FORMULATION OF THE PROBLEM 


The starting point will be the formula of Greenwood and Peierls, which 
states that the conductivity tensor is given by 


Sy — Bnet S Pan yy"8( By — By) i 


j Communicated by Professor R. E. Peierls, C.B.E., F.R.S, 


On a New Method for the Evaluation of Electric Conductivity in Metals 1021 


where v,,,,“ is the matrix element of velocity, f the electron distribution 
function, and the 8 function is to be understood in the sense that the 
limit #,,>£,, is taken after the system is considered so large that always 
there are many levels between H,, and H,,. This form as it stands is not 
suitable for computation, so it is rearranged by first writing it out in full: 


he Op" (x) 43 On (Y) 45 
q 7 ees ; ; n Y. 3 9 
Cant” = — | Volt) Se a | Yonly) Ha ary (2) 
Introduce units so that (2m/h?)—/?=1, then 


Brre2 
» Cnn inn = roa e = )z ie 0x, 7 {y,(% abn (x 7 <a Pin yy 
x 6(xv’ — y)d(y' — x) Be dy dex’ d®y’. Sige Pay KS 


Now dnl) (x)b,(x')o(H—£,,) is a Green function, the solution of the 


homogeneous ear equation. If the Schrédinger equation for 

i (H—E)s=0 ee MOP ate re, teeny (4) 

and the Green functions G,, G_ are given by 
(H—E+i6)G,(x,2')=8(2—2’) | 6) 
(H — EH —1€)G_(x, x')=8(x%—2’) i 

where e is an infinitesimal quantity used to define the contour defining 

the G,, G_, then 


Glo!) = Bale Bia) a Oe 4) eee eae ne ee lae (7) 


From these the sum and the difference can be made 


GG = 2rri Sp, (x X)py* (x o(f— #,,) 


= 27iG e . 6 ° . . . 5 2 . Fy 6 (8) 
Gs 4 Ge ma 27iP Yb, (% Jb (% )(H— iL Ne 
= 27iGp re ee et 8) 


P standing for principal part. . 
In the absence of potentials these functions are just 


(4rr)—! eV, (4rr)-te- tv), 1(47?r)— sin +/ (Lr), 
1(47?r)— cos 4/ (E71), (20 Xl) ee ee vice!) 


assuming that one already is dealing with an infinite system, Le., a 
continuum of energy so that the sums over n become integrals. So if @ 
is used for the difference of G and G_, (8) the sine like enievian, and also 


1022 S. F. Edwards on a New Method 


we now specialize to the case of a diagonal o, we have 
a / 
o= 5A) S [fff bate 2'/8x,)(O6m v9’) 
n,m 
x 6(E,, —E,,)(Of/OE ,)8(x' — y)A(y’ — #) d3x d3y dx’ dy’ . (11) 
where G(x, #’) is G(E,,;2,«'). The problem now is to find G and f in the 
presence of the scattering potential, and finally to average o over all 
configurations of this potentialt. It is convenient to express this in the 
following way: the Schrédinger equation is now 
(V2—EH+tet+ V(x))G, (x, x’) =8(4—-2'), ~) tp eee 


where 
V (x)= > U(x=Xy™ 2 A 


X,, being the positions of the scattering centres, and u the potential they 
exert on the electron. It is convenient to use the Fourier transform of 
this potential, defining 


ey Oe 


a 


V(x)= | u(k) ep, dk. ao ee 


Now if the X, are random, it can be shown by standard probability theory 
that the distribution function for the p’s is 


P(n,)=Eexp| — [| Ble idpan, Pee) 


. {) O(K, jb, 1) P,P) Pm Pk d2j dL 3m — .. j 
R(k, j) =N-18(k + j) +O(N-) 


noe 1 ? ; ene 
Uk jo toem)= NN (55) 80-4] +14 m) — Sa j)BU+ mm) ] ON) 
perm J 
(16) 
where N is the total number of scatterers, and é the normalization to give 
total probability unity. When N is large, this can be used with k running 
over the whole continuum of k space. So we reach the final formula 


1 /87e? 


= 3 ; J ae 4) EP(py)(OG (a, &')/ OX, (OG nly, y’)/OY,) 


x 8(B, — By) f/2E,,)8(« — y')3(y — 2") dx dy dx’ dy’ Tp, dp,*. (17) 
This form has the great advantage that it is essentially the same form as 
that of electrons interacting with the quantized electromagnetic field, 
and so techniques for evaluating it are already in existence. Moreover, 
there are none of the divergence problems of electrodynamics here and the 


various approximate techniques of electrodynamics can be applied with 
confidence. 


{The meaning of the averaging is discussed in detail by Kohn and 
Luttinger (1957). 3 


for the Evaluation of Electric Conductivity in Metals 1023 


§ 3. EVALUATION 

The essential difference between (1) and (17) is that the averaging over 
the scatterers can be carried out before the integrations over coordinates 
and allows manipulations which are meaningless when applied to (1). 
Although one can, on the basis of (17), derive integral equations for the 
average of G(x, x’), G(y,y’), it is simplest to consider the perturbation 
expansion of the G’s, from which the structure of the integral will become 
clear. However, one should emphasize that there is no need to approach 
the evaluation by a perturbation approach and the results to be obtained 
below can be got directly. 

Consider firstly a simpler problem, that of obtaining the average of 
just one G alone. This is the difference of the averages of G,. and G_, 
which are more convenient to consider. Now in perturbation theory one 
can write, using G® for the p-independent functions (10), 


ee, = G OG, a) — | GO(x, y)u(k) ep, *G_OY, x’) dy Bk 


+ | | | | G(x, yyu(k) ep, #4 Oly, z)u(j) eF%p,*G,(z, x’) 


<Geyd20 nd) ope eee KES) 
Upon averaging, using brackets for average value 
GOON. Ps ae ee eee ai) 
{PP ) = NO(k + j) ees ee 120) 
{PKP;PPm » = N?2DS(k + jd(l+ m)+Ni(k+j+l+m) . . (21). 


perm 


and so on, neglecting terms relatively of order N-1. This can be obtained 
directly of course, without using the expansion (16). This gives 


(Gv, 2") =G,(a, 0°) + | [| NOC, y) Yuk) 


x E,O(y, 2), (2, x") dy dzd®k+.... . . (22) 


This is conveniently expressed in diagrams, which are slightly different 
from those of electrodynamics. Consider G(x, x’) before averaging, draw 
a full line for every G, and a dotted line for every up. Then the expansion 
of G(x, x’) is written 


- 
- 
er er) 


+ ++ O° (23) 


1024 SC Ff. Edwards on a New Method 


The averaging process joins the loose dotted line ends up, and places the 
value N at a join of two, N at a join of four, and so on. Mark these joins 


by a large dot. Then the average appears as 


aCe @-- 
(6,)= = 7 (B) 
SF te tet 
pe Riera itll cad P=), 
sites Carries ms ose 
” : Seal 53 ae 
pet he be +... (2 
(y) m6 (A) 
--8- --@-. 


These diagrams can readily be labelled in configuration or momentum 
space 


ee O~-. oo @.. 
es * ra Gin aee 
rf 
ee ee : i (25) 
x y z x! P p-q 2) 


the dot having significance only when diagrams like the last in eqn. (24) 
appear which incidentally has no analogue in electrodynamics}, which 
is effectively the case of NV infinite. Now concentrate attention upon the 
series 


; ) prea -@. 
ae 2 See + eee (26) 
This is a simple expansion of the series 
; =i 
(4.0°)-N G.mgne(p—qPq) sD 


} In electrodynamic language the G, is Sp, the dotted line the vertex I, 
the dot the photon Green function Dy. 


for the Evaluation of Electric Conductivity in Metals 1025 
Since Gq) = (277) 3(q? - H+ie)— the etntogre can be written 
AP | G,( Gi. (q)u*(p —q) d®q + — ' = Qn? =; | 8(q° — L)u*(p —q) dq. (28) 


If now w is taken as isotropic so that wu? is u>(p? + q? — 2pq cos 8) this can be 
written as 4+7B, where A, B are real 


A=4NP | (q°—B)'w*(p? +g —2pq 00s 6) dy eae et 20) 


B= (2r)7 nN | 3(q?—B)u*(p*-+-g°— 2pq cos 6) dg. ice Tesi) 


These can be expanded as series in p?—H so that this approximation to 
<G)— is in the form 

(p?—H+te+a,+1b,+ (p?—E)(ag+tb,)+...) . . . (81) 
where 


ay = $(27) SNP | (q° — #) tw E + q? — 2q0/(E) cos 6)3(p*— E) d?qd®p (32) 


b, = N»/E(87)— | sin @d0u2(2H(1—cos6)). . . . . . . . (33) 


u* is in fact the differential scattering probability in Born approximation, 
W (8) = (477?)?277h-1u?(2H (1 — cos 6)) 


b= (27)-84/(E)AN { sinOw,(6)d0. . . . . (84) 


A rather more refined treatment is to expand not in terms of p?—£H but 

*_ #H+a, +b, an important step in electrodynamics where only a, — H is 
defined, but here it makes little odds as we are anyway taking all the a’s 
to be small, and the effect of the terms dg, b, etc. will come out very small. 
So it has been found that this series summed gives effectively a complex 
displacement of the energy H 


(GCE) AG dia OL) er en ere ade (oD) 
where (H+6H)¥? = (H+a,+1b,)¥? 
~ S/H + (a,4+1b,)/2VL 
= (/H' +i. et eee ee ere.) (30) 


So in configuration space 
(Goa (diherOn Coates is | (87) 
(cf. Bardeen 1956). If this calculation had been performed for G_ the 
result would have been : 
CG Age Oe ele ee vn (88) 
Now consider to what extent one can take the forms (26), (37) as 
adequate approximations to the whole series. Consider first those terms 
containing one dot only, in particular the series 


- te Ad e ‘ 
. ‘ * 


te 


(39) 


1026 S. F. Edwards on a New Method 


This series is in fact building up the exact scattering of one electron by 
one scattering centre, instead of its first Born approximation, and if 
instead of taking the unit 


in the series (26) one took all one dot diagrams one would just replace 
the Born approximation scattering by the true differential scattering 
cross section in (34), w(@). This can be important in strong interaction, 
and this way of looking at it will be valuable if one can think of the 
electron in strong interaction with the scattering centres one at a time, 
as is usually considered to be the case. In a dense system, however, the 
electron interacts with many centres at once, and one cannot disentangle 
the scattering with one centre from that of all the others, i.e. the other 
terms to be discussed below. Henceforward these diagrams will be 
ignored except inasmuch as the differential scattering cross section can be 
understood as the true one rather than Born approximation. Now turn 
to the other terms in (24), in particular, say, (6) and (e). The electrical 
conductivity based on the approximate sum (31) and further calculations 
to be given below comes out to be of order [-1, i.e. inversely with the 
square of the interaction. Terms like (5), (€) and the higher terms, if 
included, give a series in the interaction, but do not alter the first term 
which will still dominate the calculation. To see this it is perhaps 
simplest to look at G in configuration space. The series which has been 
considered so far (26) amounts to 
(4arr)—t ef V2 Pr — (darr) 1 ef VO" + Er +- 4 Prt |.) (40) 
where L=U/H'—-V/EH)-T. . . .. . . . (0 
The inclusion of terms like (8), (e) etc. adds in terms so that in first order Lr 
is corrected by a constant, in second order $L?r? is corrected by a term 
in r, and so on, always a power of r less in any order, so that summing on 
the basis of (31) one has 
(Ge) = (4a) tet VE Pepe) ene) 


and, as will appear below, this affects the conductivity by second and 
higher terms in u? 


o~O(0)+0(1)+0(1)+0(T2)+ .... ee ee 


Thus (37) is a good basis for the evaluation of <G,) and hence, of 
course, (G). 


: awe 
To summarize in a rather formal way the above discussion, consider 
the identity 


Gala, 0 y=G (ee [ 


+] [| | CGY) er kp. (y, 2)uli) eC, (2, 2" 
x dy O22 20") ee ee re (44) 


G O(a, y) eu (k) eG Oy, at) dey dek 


for the Evaluation of Electric Conductivity in Metals 1027 
Then 
CGE, 2')) = Ox, 2’) — | GO (a, y) &*Yu(k) (p, \G,(y, a) dy dk | 


{Hy G2, y) 8M u(k) Coy, (y, 2)u(j) op, 2,2") 


NN) Paauee pee le eto WO 4B) 
= G(a, 2") + +) [| Ee wulkjoG) eh *.,0G, 2) 

x (o.pG,(z,0")) dy d2dkdj 2... . (46) 
pet i JF @ MEW. 2)G.ex)) Py dee, (47) 


which defines X(y,2z). Then tt ea 
| GG ty = ME ees) 


The discussion above has shown that when the interaction is weak, an 
evaluation of & by perturbation theory is adequate. So far all the 
discussion has concerned (G,), but (17) involves the evaluation of 
(G,,'G,,0f/0E), and the two G’s and the f will interfere. The dependence 
of f upon the p’s in the averaging does not affect the answer in its leading 
term, so it is ignored for the present. Since the problem is already being 
considered for an infinitely large conductor the distinction between n and 
m can be dropped at this stage. Consider at first the quantity 
(G(x, x’)G(y,y’)). In terms of the diagrams, this consists of two full 
lines, with dotted lines leaving and entering both, in particular, in 
addition to the types of series (24) one also has types 


(a) (b) (c) 
eee dio Sed 
anneaaan, sshcukas > aka aa a a ace, = 
nes : aur: 
& Od e 
> Tm} = a 
a a, ; 3 
a - 


(d) (e) (f) 


Types (d), (f) clearly belong to the same category as those of (24, A) and 
will be considered no further. Also disregard type (e) which is an inter- 
ference between type («) and (8), and corresponds to the Lamb shift in 
electrodynamics: it can readily be found to be small when the interaction 
is weak. Of the remainder, types (a), (b) are again the first two of a simple 


1028 S. F. Edwards on a New M ethod 


series analogous to (8), (y) of (24). Assuming then that terms like (a), 
(b) do not interfere with terms like (8), (y) one has the equation for (GG) : 


(x, Hy, y') = (Ce) Ay.) 
+N | { G(w,2))<G(y, w) put) 
x etkz—w) (Gi(z, x’ )G(w, y’)) d®k dzdw, . (50) 


or in the spirit of (48) the exact equation can be written symbolically in 
terms of a generalized ‘ interaction’ [ 

(GG) = (4) (G4) + (4) (GOI KGG) ers a es i) 
and as in (48), by explicit calculation of the errors, one finds that J can be 
evaluated by perturbation theory if the interaction is weak. ‘The first 
approximation, the sum of (a), (6) . . ., is then (51) with 


I(«, B)=N i d3ku{k)e*@-, =. |. (52) 


The quantity required has v’=y, y’=x, so let A be (GG) in this case. 
Then if A is written in momentum space, the quantity required in (17) is, 
where Q is the total volume, 


Q| p-qA(p.q) dp dq in ss teh ae EE 
where (52) becomes, putting 
(G(p))<G( =p) ) = 9p). = Ve ees 
A(p.4)=9(v)5(p —4) +N | g(p)u*(p—s)A(s,q) d°s. - . (55) 
Let | P-aA(p.4)d’g=K(p) me 
then 
K(p)=p°g(p) +N | g(p)w%(p—s)p.qA(s,q)d°sd%q —. . . (57) 


= p"9(p) + InN { 9( p) u( p? + 82 — 2ns cos 8) cos 6 sin 6 


xK(s)stded0. . . . » (58) 


The fact that p.q=pqcos6,,, effectively contributes pq cos 6,, is seen 
by writing all the quantities concerned in spherical harmonics, or perhaps 


simpler by considering the series of which (52) is the sum, consisting of 
terms like 


n S 


) 


for the Evaluation of Electric Conductivity in Metals 1029 


The integration over q with cos@,, leaves a cos 0,, for the s integration, 
and so right through the diagram, “which j is in effect (58), 

Now g(p), apart from the displacement [ is just a function with 
p= or more accurately p=1/H’. K(p) has essentially this same 
feature, so to the accuracy that has been used so far, the solution of (58) 
can be written down at once 


K(p)=q(p) E —27N | u?(2E(1 — cos @)) cos Asin 6 | q(s)d2 as | Os . (59) 


The integral of K can be obtained by contour integration or by the 
discussion below in equations (64)—(66). 


| K(p) @p= aan (60) 
= = (-—1”’)4 apcei as Me ROY) 
where 
I’ = (82r)-1Nii | u?(2H'(1 — cos @)) sin 8 cos 6 dé a0) 
or in the notation of (34) | 
T’ =4(20)-8Nh | cosOsin6w,(6)d0. . . . . (63) 


It is perhaps useful to look at (60) in configuration space. Without 
scattering the conductivity is 


ee (Cs & aE[v sin ey sin a2 (aflak) dr. (64) 


This diverges at large distances where the integral over r looks like 


~ i dr. 


When the scattering is introduced, the term sin ,/(H)r is replaced by 
sin,/(H’)reT* and also the averaging process introduces the cross 
term ae giving altogether, at large distances, 


[eats ar ee (65) 


Oe lee) ame (68) 


This, of course, is the usual answer (cf. Peierls 1955, eqns. (6.16), 
(6.20)); Peierls w is our Nw/Q, in particular for free electrons where 
of/a# is approximately a 5 function at the Fermi surface, so integration 
over the Fermi surface gives the final result 


A —1 
= en ( damn, | w(6)(1-— cos 0) sin 48) ee Os.) 


where », is the density of electrons, and n, the density of the scatterers, 


1030 SF. Edwards on a New Method 


The dependence of f upon the configuration of scatterers can be 
expressed by expanding it in a power series in the density of scatterers 
and adding in the terms so produced. These corrections are of the same 
order as those in (43), and since they are well defined at 7 =0, do not 
involve the temperature in any critical form. 

It is worth remarking that in averaging over all configurations, one 
includes those configurations for which the conductivity 1s infinite, for 
example an ordered lattice system. The method of calculating presented 
here automatically gives these configurations negligible weight, but a 
rigorous treatment would require a more careful treatment. 


§ 4. Discussion 

It has been shown that the exact formal solution of the equation of 
motion can be evaluated to give the usual solution of the Boltzmann 
transport equation, and within the framework of weak interaction it is 
quite easy to write down higher order corrections, though of course these 
rapidly become very numerous. Of more interest is the possibility of 
evaluating formula (11) in cases where perturbation theory is not 
applicable. An example, which is still far from being the most general 
state of affairs but is of physical interest, is the case when the distribution 
of scatterers (which may be lattice vibrations, etc.) is known through a 
partition function, and the electrons still interact weakly with the 
scattering centres. This is a model of a liquid or an alloy. For a liquid, 
eqns. (20), (21) and so on, are not satisfied, and the averages can only be 
found from the partition function. If it could be assumed to a reasonable 
degree of approximation that 


(PxP;jPiPm”* ) = 2L"(k — j)F(1— m) ey ees) 
where eee 
Pik=j)= (pp >) a 
then one could immediately write down the conductivity by replacing 
| w(0)(1—cos@)sin@d@ . . . . . . (70) 
by 
| FBC — 0s 4))n0(8)(1 — 008 Aisin Odd) ene at) 


in (67), Since one is dealing with smooth averages this approximation 
may be adequate for this problem, even if it is not so for the theory of 
liquids as a whole. In general, however, the conductivity will involve not 
only the two body correlation function F, which is available experimentally 
but the whole partition function, which at present is not available. There 
are models available for alloys, however, in particular for super lattice 
forming alloys and this problem is being considered further, Methods 
are in existence for evaluation of formulae like (17) in cases where 
perturbation methods are inapplicable, but a discussion of these will be 
left until they have been successfully applied. 


for the Evaluation of Electric Conductivity in Metals 1031 


ACKNOWLEDGMENTS 

The author would like to thank Professor Peierls for suggesting this 
problem and for helpful discussions during its solution. He would also 
like to thank Drs. G. V. Chester and A. Thellung, who have also considered 
the evaluation of the exact formulation from a different viewpoint, for 
helpful discussions, and Professor J. M. Luttinger for pointing out some 
inadequacies of the first version of the work and for drawing the author’s 
attention to the work of Bardeen. 


REFERENCES 


BARDEEN, J., 1956, Handbuch der Physik, Berlin, 15, 274. 
GREENWOOD, D. A., 1958, Proc. phys. Soc. Lond., 71, 585. 
Koun, W., and Lurrrmveer, J. M., 1957, Phys. Rev., 108, 590. 
Kuso, R., 1956, Canad. J. Phys., 34, 1274. 

Nakano, H., 1956, Progr. theor. Phys., 15, 77. 

Pererts, R. E., 1955, Quantum Theory of Solids (Oxford). 


[ 1032 ] 


The Deformation of Metals by Self-Diffusion; 


By A. P. GREENOUGH 


University College, Swansea, Glam. 


[Received June 2, 1958] 


ABSTRACT 


The experimental data on the deformation of metals at temperatures near 
their melting points is reassessed. It seems that at any given stress, an 
initial period in which the strain rate decreases is followed by a second period 
in, which the strain rate is constant. The Nabarro—Herring theory satisfac- 
torily accounts for the constant strain rate. In the initial period, the higher 
strain rate cannot entirely be attributed to offsetting and kinking at grain 
boundaries. It is suggested that edge dislocations may rotate about suitable 
nodes, and in planes normal to their slip planes, thus acting as sources or 
sinks for vacancies during the initial period. 


§ 1. INTRODUCTION 


Ir is about ten years since Nabarro (1948) and Herring (1950) indepen- 
dently suggested that metals might be able to deform under very low 
stresses by a self-diffusion mechanism. LEssentially, it is postulated 
that vacant lattice sites diffuse through the crystals, thus effectively 
producing mass transfer in the opposite direction to the vacancy flow. 
Both authors considered that the grain boundaries and the free surfaces 
of the metal specimens could act as sources or sinks for vacancies. 

Since these papers were published, the results of a considerable amount 
of experimental work have appeared. Unfortunately, in many cases, the 
analysis of the results of this work is incomplete or inaccurate, and there- 
fore it seems appropriate that the work in this field should be critically 
reviewed, 

Herring derives expressions for the relationship between the applied 
stress (which must be corrected for surface energy effects) and the strain 
rate, for both wire and foil specimens. In the case of a wire which has a 
‘bamboo’ grain structure, i.e. the grains occupy the whole cross section 
of the wire, and the grain boundaries are perpendicular to the wire axis, 
we obtain from his eqn. (16) the expression : 


die le DoS 
Ldt L2rRT 
where L=average length of crystal grains in wire, t=time, D=volume 
self-diffusion coefficient, Q=gm atomic volume, r=radius of wire, 
R= gas constant, 7’=temperature, B=numerical constant = 12 (approx. ) 
when L=2r, and S=applied stress, corrected for surface energy effects. 


Strain rate, 


BS. 2/5) cs 


+ Communicated by the Author, 


On the Deformation of Metals by Self-Diffusion 1033 


Both Herring and Nabarro give good reasons why grain boundaries 
separating grains with less than a minimum orientation difference should 
not act as sources or sinks for vacancies under these conditions. The 
strain rate : stress graph for wires of given structure should thus be linear, 
and the slope of the graph at any given temperature (e,) may be calcu- 
lated if D is known. 

Since the D may be expressed as follows: 


DE AOSD (HTT) eS ie oe curse we 4 2) 


where A=constant and #=activation energy for self diffusion, Z may 
be found from the gradient of the loge,/7:1/7 graph. For typical 
metals this is 5-10% higher than the activation energy for creep calcu- 
lated from the slope of the log «,:1/7' graph. 

For a given grain structure, the strain rate at any temperature is directly 
proportional to the applied stress, and the deformation resembles viscous 
flow in this respect. For this reason, some authors prefer to compare 
experiment with theory in terms of apparent viscosity. 


§ 2. Review oF EXPERIMENTAL WoRK 
2.1. Copper 


Herring (1950) first analysed the results for copper wires obtained by 
Udin et al. (1949) and Udin (1951). Table 1 compares the gradients of the 
strain rate : stress graphs obtained experimentally with the gradients 
calculated from the most recent values of the self-diffusion coefficients of 
copper obtained by the radioactive tracer technique, using single crystal 
specimens (Kuper e al. 1954). The slopes calculated from the self- 
diffusion data used by Herring are also shown. 

The activation energy for self-diffusion derived from the deformation of 
the wires is 61000 cal/mole, compared with 47140 cal/mole obtained by 


Table 1 


Slope of strain rate : stress graph cm?/dyne/sec 


Temperature} Wire diam. 


Calculated. 
Herring 


Calculated 


Observed Kuper et al. 


Dei Opel Oms 
DAS li)meee 
EGS KOE 
3-38 x 10-4 
1-39 x 10-4 
8-99 x 10-14 
2-75 x 10-4 


3°60 10-4 
2-60 x 10-4 
1-85 x 10-4 
8-98 x 10-15 
5:68 10-4 
292 < 10-4 
1-42 x 10-4 


1-23 x 10-4 


1-92 x 10-14 


1034 A, P. Greenough on the 


the radioactive tracer technique. Contrary to theoretical predictions, 
the slopes of the strain rate : stress graphs at a given temperature are less 
for the smaller diameter wires than for the larger diameter wires. 

Pranatis and Pound (1955) studied the deformation of copper foil. Un- 
fortunately, they consider that “by definition, the reciprocals of the slopes. 
of the strain rate : stress curves are the viscosities, that is o=é’’, and thus, 
they omit a numerical constant in deriving the apparent viscosity from 
their experimental measurements. 

Frenkel (1945) showed that when a longitudinal force F’ acts on a viscous, 
rod length L, volume V, the extension of the rod is given by 


1 dL F 
mea tt tt (8h 
lI? dt 3nV 
where 7 is the viscosity, i.e. 
oaijé «_4h¥d. <8 ase 


where o=applied stress and €=strain rate. 

As noted by Herring (1950) there is a numerical error in Frenkel’s paper 
which has here been corrected. Since the equations used in the deriv- 
ation of this formula are linear, the deformation due to a transverse 
force acting simultaneously on the rod may be considered independently. 
The foils of Pranatis and Pound may be considered to be unrestrained at 
the edges. Thus the apparent viscosities of the specimen can be derived 
from the slopes of the strain rate : stress curves, but the values given by 
Pranatis and Pound should be divided by three. 

It is not clear how Pranatis and Pound have derived their calculated. 
values of the viscosity. On p. 667 of their paper they mis-quote the 
expression derived by Herring for the apparent viscosity of foils of this 
type, where the tangential stresses at the grain boundaries are relaxed. 


The formula should be: 
1/3\28/RT7 
—— — is 2/3 » 
4 10 (=) (Fa) Vo 1S Se 


where V,=average grain volume. Pranatis and Pound do not indicate 
how they have calculated J’,. 

In table 2 the slopes of the strain rate: stress graphs is calculated 
assuming that 


Vi =a, ee eee 
where r is the average radius of the grains and a is the thickness of the foil. 

Data on the self-diffusion coefficient of copper as determined by the 
radioactive tracer technique has been taken from the work of Kuper et al. 
and also from the work of Rollin (1939), so that comparison may be made 
with the calculations of Pranatis and Pound. 1051°c has been considered 
as 1050°C, 1024°c as 1025°c, and 1002°c as 1000°c. 

Pranatis and Pound give the activation energy for the self-diffusion of 
copper from their results as 56 800 cal/mole. The best result obtained by 
the radioactive tracer technique is 47 140 cal/mole, 


Deformation of Metals by Self-Diffusion 1035 
A number of unweighted single crystal specimens 0-0015in. thick were 


heated under conditions where shrinkage would be expected. However, 
no deformation could be detected. 


Table 2 


Slope of strain rate : stress graph 


Temperature Foil em?/dyne/sec x 10% 
°C thickness 
(in.) ed Calculated Calculated 
serie Kuper et al. Rollin 
1050 0-0015 15-7 in He 50:8 22:8 
20-0 in H, 50:8 22:8 
0-002 10-7 30-9 13-7 
0-003 8-96 22:6 10-0 
0-004 8-96 23:3 10-3 
0-005 4-90 16-8 7-42 
1025 0-0015 14-21 in He 36-7 14:8 
14-43 in H, 36-7 14:8 
0-002 8-26 44-7 8-89 
0-003 7:19 16-4 6-51 
1000 0-0015 10-53 in He 26-1 9-44 
10-70 in H, 26-1 9-44 
0-002 6-21 15-9 5:67 
0-003 4:27 11-4 4-15 
975 0-0015 9-01 18-4 5:94 
960 0-0015 9-38 14-9 4-44 


2.2. Silver 


Greenough (1952) studied the deformation of silver wires in nitrogen, 
and found that reproducibility of results was poor. The average strain 
rate for nine specimens of 0-01016cm diameter wire heated at 920°c for 
741 hours was — 7-61 x 10-® per sec, the results for individual specimens 
ranging from —9-97x10-® to —5-02x10-% per sec. The predicted 
strain rate was — 2-91 x 10-® per sec, on the basis of the radioactive tracer 
experiments of Tomizuka and Sonder (1956) on silver single crystals. In 
a second experiment, the average strain rate in eight specimens at 920°c 
for 442 hours was — 10-6 x 10-® per sec, results ranging from — 13-3 x 10° 
to —8-28x10-* per sec. The predicted strain rate was — 4:87 x 10~ per 
sec. 
The deformation of two single crystal wires 0-01 cm diameter with an 
applied stress of the order of 12 x 104 dynes/cm? could not be measured 
after heating to 910°c for 283 hours. The strain must have been less than 

4A2 


1036 A. P. Greenough on the 


4x10-3, Under the same conditions, the strain in a similar poly- 
crystalline rod having five grains across a diameter was 13 x LOz; 

Funk et al. (1951) studied the deformation of 0-0129 cm diameter silver 
wires in a helium atmosphere for periods up to 116 hours. From their 
results, and from the self-diffusion coefficients calculated from the results 
of Tomizuka and Sonder, the following table has been drawn up. 


Table 3 


ee Le 


Slope of strain rate : stress graph 


Temperature cm2/dyne/sec 
Oo 
Observed Calculated 
876 (one result) ee A 0-97 x 10-14 
907 (three results) a1 10-7 1-57 x 10-4 
923 (three results) 65x 10-4 1-99 x 10-4 
939 (three results) 14-5 x 10-4 250 x 10-—™ 


The activation energy for the self-diffusion of silver derived from the 
deformation of the wires is 122 000 cal/mole (omitting the result at 876°c), 
or 108000 cal/mole taking into account all the results. 

The work of Buttner et al. (1952a) indicates that a partial pressure of 
oxygen in the helium of 0-1 atmosphere makes no significant difference to 
the slope of the strain rate: stress graph at about 935°c. 


2.3. Gold 


Alexander et al. (1951) reported the results of experiments on gold wires, 
but the experiments were carried out under conditions where the wires 
must have been rapidly contaminated by nickel and chromium from the 
nichrome alloys present in the furnace. However, the experimental 
results are in reasonable agreement with predictions based on Herring’s 
theory. Evidence was presented to show that wires which were shrinking 
tended to neck and break up into small pieces. The process is illustrated 
for wires lying on an alundum surface. In this case, friction at the points 
where the wire touched the surface would give rise to this effect. The 
apparent bulges and necks observed in sections of suspended wires can be 
interpreted in terms of the kinks which develop in the wires. No necks 
would be expected if the wires were shrinking by the Nabbaro—Herring 
mechanism. 

The experimental work of Buttner e¢ al. (1952b) in which 0-0064cem 
diameter wire was heated in a helium atmosphere, seems to be more 
satisfactory. However, their interpretation of their result is unsatis- 
factory. In their eqn. (6), they derive an expression for the apparent 
viscosity of the wires by the method of dimensional analysis, but they 
omit to insert the numerical constant required in this method. The values 


Deformation of Metals by Self-Diffusion 1037 


which they derive for the apparent viscosity of their wires should thus be 
divided by three, as reference to the original paper by Udin et al. (1949) 
will show. 

In calculating the apparent viscosity from Herring’s formula, they 
ignore the self-diffusion data for gold obtained by the radioactive tracer 
technique, and reported in a paper by Dienes (1950) which they quote. 
Instead, they use the activation energy from their own experimental 
results to calculate the self-diffusion coefficients for gold, on the basis of 
Dienes’ formula (1950). 

In table 4 the slopes of the strain rate: stress curves obtained by 
experiment is compared with the slopes predicted by Herring’s formula, 
using the data obtained by the radioactive tracer technique by Gatos 
and Kurtz (1954), by Makin et al. (1957) and by Okkerse (1956). The last 
author worked with single crystal specimens. 


Table 4 


Slope of strain rate : stress graph 
cm?/dyne/sec x 104 


Temperature 
*C Calculated 
Observed) |= __________. 
Gatos et al. | Makin et al. Okkerse | 
1007 (three results) 4-12 2-64 3°74 3-20 
1017 (one result) 4-85 3-01 4-22 3°58 
1025 (three results) 5:95 3°34 4-64 3°91 
1042 (three results) 7-14 4-14 5-63 4-70 


The activation energy for deformation obtained by Buttner et al. is 
51000 cal/mole. On this basis the activation energy for self-diffusion 
would be about 55000cal/mole. From radioactive tracer experiments 
the activation energy for self-diffusion has been given as 45 300 cal/mole 
(Gatos and Kurtz), 41700cal/mole (Makin et al.) and 39360 cal/mole 
(Okkerse), the two latter values probably being the more reliable. 


2.4. Nickel 


Hayward and Greenough (to be published) studied the deformation of 
0-01218cm diameter nickel wire in argon. During the first 30 hours or 
so, the strain rate in nearly every specimen decreased. Thereafter the 
rate remained constant to the end of the experiment. The constant rate 
was generally about eight times faster than the rate calculated from radio- 
active tracer self-diffusion data, and the self-diffusion activation energy 
calculated from deformation was 53000 cal/mole, compared with 66 800 
cal/mol from radioactive tracer work (Hoffman et al. 1956). A “* pre- 
liminary rough study” reported by Burgess and Smoluchowski (1955) 
gives an activation energy in the range 61 000-65 000 cal/mole. 


1038 A. P. Greenough on the 


After long periods of deformation, the appearance of the grains suggested 
that the Nabarro—Herring mechanism was responsible for deformation. 


2.5. Aluminium 


The deformation of 2:54mm thick aluminium sheet at temperatures in 
the range 577-647°c has been studied by Harper and Dorn (1957). Un- 
fortunately the experiments were carried out in air. It is probable that 
the oxide film which grows on the metal surface under these conditions 
profoundly modifies the processes of generation and absorption of vacant 
lattice sites on the surface of the metal. It is thus very likely that the 
deformation process studied here is not strictly comparable with the process 
considered by Herring, and the process in the other metals here considered. 
However, it was found that the strain rate decreased with time and then 
became constant, the constant rate being proportional to the applied 
stress in the range 3-18 lb/in? (2 x 105-13 x 10° dynes/cm?) approximately. 

An interesting feature of this work was the scribing of reference marks 
on some specimens by a diffraction grating ruling machine. This enabled 
a detailed study of the deformation to be made. It was shown that the 
creep curve for specimens so marked agreed very well with that for an un- 
marked specimen tested under otherwise identical conditions. Grain 
boundary shearing was shown to account for about 12° of the total 
deformation. Further, it was argued that if the Nabarro—Herring 
mechanism was responsible for the deformation, the strain between 
markers within a grain should be zero, and all the extension of the grains 
should be concentrated at the transverse grain boundaries. 

A specimen was ruled with a series of grid lines 0-003937 in. apart, and 
a stress of 10]b/in? applied at 647°c until the strain was 0-0170. The 
average grain diameter was 3-3mm, so there must have been about 30-40 
reference lines on each grain. After deformation, the average strain 
across transverse grain boundaries was about 0-01695. The implication 
here is that the distance between grid lines could be measured with an 
accuracy of at least +10-*in. It is difficult to see how this could be 
achieved, and the authors give no account of the method of measurement 
which they used for this purpose. The average strain within the grains was 
0-01682. 

In a further experiment, the deformation of a single crystal of aluminium 
at 647°C under an applied stress of 91b/in? was studied. It appears that 
the single crystal deformed in much the same way as a polycrystalline 
specimen under identical conditions, but owing to some mistake, details 
of comparison have been omitted from the paper. 

Comparison of the observed creep rate with that predicted from self- 
diffusion data is difficult, as no radioactive tracer experiments can be 
carried out with aluminium. The best estimate for the self-diffusion 
coefficient of aluminium predicts a slope for the strain rate: stress graph 
which is about 7 x 10~4 times the observed slope. 


Deformation of Metals by Self-Diffusion 1039 


The activation energy for deformation is about 35200 cal/mole. The 
results give an activation energy for self-diffusion of 36 900 cal/mole. 


2.6. Tin 


Chalmers (1936, 1937) worked in air with cylindrical specimens, 7em 
long, and not less than 3mm diameter. A central 3cem length in each 
specimen was used as the gauge length. Most specimens were single 
erystals over the gauge length. In other specimens there were a few 
crystals with longitudinal grain boundaries along the gauge length. 
Stresses up to 2 x 10’dynes/cm? were applied at 21-3°c, and changes in 
length of the gauge section were measured. At stresses up to 105dynes/ 
em? the strain reached a limiting value of the order of 10-5. The initial 
rate of creep was proportional to the applied stress, and of the order of 
5 x 10~* per sec for a stress of 10°dynes/em?. Beyond the limiting strain 
avery slow deformation rate, of the order of 2 x 10—-" per sec, was observed. 

In the specimen with the longitudinal boundaries, no plastic deformation 
could be detected for stresses below a critical stress dependent on the 
specimen, but of the order of 10’ dynes/cm?. 


§ 3. Discusston 


In cases where the deformation of the specimens has been observed in 
detail for periods longer than about 50 hours, it has frequently been found 
that the rate of deformation decreases during the first 30 hours or so, but 
thereafter remains constant. This has been observed for silver and nickel 
wires, and aluminium sheets, but the rate of deformation of copper foils 
appears to be constant. On the other hand, it is significant that the 
strains in the smaller diameter copper wires of Udin et al. were measured 
over longer times than for the corresponding larger diameter wires. The 
strain rates observed in short-time experiments in copper and silver were 
appreciably higher than those predicted by self-diffusion data, though 
this is not true for gold. 

For copper, gold and nickel there is satisfactory agreement between the 
values of the activation energy for self-diffusion derived from the de- 
formation results and from radioactive tracer work. There are indications 
that short-time tests give rather higher values for the activation energy, 
and this may be the source of the discrepancy for silver. 

In some nickel specimens at least, the initially higher strain rate 
cannot entirely be ascribed to offsetting and kinking at grain boundaries 
(Herring 1950, Greenough 1952). Extrapolating the linear portion of 
the strain (<)-time (¢) curves to t=0 gives a value for « of about 2 x Ome: 
of which 10-3 at the most could be attributed to offsetting and kinking. 
Another deformation mechanism must operate to give a strain of about 
10-3 during this initial period. 

It is likely that when the stress applied to a specimen is changed, simple 
edge dislocations act for a time as sources or sinks for vacancies. A simple 


1040 A. P. Greenough on the 


estimate shows that this process alone cannot give rise to strains of the 
required order of magnitude. 

When a tensile stress ig applied to a crystal, those edge dislocations with 
their half-planes of atoms perpendicular to the stress can climb by the 
addition of extra atoms to the half-plane. The dislocation may then be 
said to act as a source of vacancies, and the crystal extends by the length 
of the Burgers vector, multiplied by the ratio of the increased area of the 
half-plane to the area of cross section of the crystal. Edge dislocations 
on other slip planes and the external surface of the crystal act as sources 
for the atoms, i.e. sinks for the vacancies. If the climbing edge dislo- 
cation is pinned at two nodes, then the dislocation bows out and increases 
in length (Lomer 1957). Dislocations acting as vacancy sinks must also 
become curved, but this will be ignored in the following analysis. 

The equilibrium radius of curvature of the dislocation line may be 
determined in the same way as the curvature of a dislocation line in the 
slip plane when a shear stress is applied. If b is the Burgers vector, W 
the energy per unit length and r the radius of curvature of the dislocation, 
then the applied tensile stress is given by 

W 
Gat Rowe ee 
Putting o=5 x 10®dynes/em*, b=2x 10-°cm, W=5x 10-*erg/em, gives 
r=5 x 10-7 cm. 

If the average length of edge dislocation between nodes is 10-4em and 
the dislocation density 108 lines/em?, the area of each dislocation ‘bow’ is 
about 25 x 10-13 cm? and the strain produced in the direction of the tensile 
stress is at the most 10-8. In order for a dislocation to act as a ‘ Frank— 
Read’ source or sink for vacancies at this stress (Lomer), the distance 
between nodes would have to be of the order of 10-1 cm, which is greater 
than the thickness of most of the specimens. 

If this mechanism operates at all, the sources must be lengths of edge 
dislocation rotating about a single node and acting as a ‘vacancy mill’. 
This process would operate most easily if the free end of the dislocation 
terminated on a free surface of the crystal at every stage in the rotation. 
A grain boundary could easily prevent the operation of this mechanism, 
the adjacent grain being unable to accommodate the deformation in the 
plane of the rotating dislocation. The total deformation would be inde- 
pendent of the magnitude of the applied stress if the rotating dislocation 
were stopped by structural barriers in the crystal, but the direction of 
deformation would depend on the direction of the effective applied stress. 

On this basis, the higher initial strain rate should be most marked in fine 
wire specimens with the ‘bamboo’ structure, and least in foil specimens. 
The observations of Chalmers on tin can also be explained on this basis. 

The work of Barnes et al. (1958) on the ‘precipitation’ of a super- 
saturated solution of helium in copper pointed to similar conclusions with 
regard to vacancy sources in the metal. 


Deformation of Metals by Self-Diffusion 1041 


ACKNOWLEDGMENT 


Tam grateful to Dr. E. R. Hayward for discussions on this topic, and for 
details of his experimental work on nickel. 


REFERENCES 


ALEXANDER, B. H., Dawson, M. H., and Kura, H. P., 1951, J. appl. Phys., 22,. 
439. 

Barnes, R.S., Repprne, G. B., and Corrrenn, A. H., 1958, Phil. Mag., 3, 97. 

Burcsss, H., and SmoLucHowskt, R., 1955, J. appl. Phys., 26, 491. 

Buttner, F. H., Funx, H., and Upmy, H., 1952 a, J. phys. Chem., 56, 657; 1952 b, 
Trans. Amer. Inst. min.(metall.) Engrs, 194, 401. 

CHALMERS, B., 1936, Proc. roy. Soc. A, 156, 427; 1937, J. Inst. Metals, 61, 103. 

Dienes, G. J., 1950, J. appl. Phys., 21, 1071. 

FRENKEL, J., 1945, J. Phys., Moscow, 9, 385. 

Funk, E. R., Upin, H., and Wutrr, J., 1951, Trans. Amer. Inst. min. (metall.) 
Engrs, 191, 1206. 

Gatos, H. C., and Kurz, A. D., 1954, Trans. Amer. Inst. min. (metall.) Engrs, 
200, 616. 

GREENOUGH, A. P., 1952, Phil. Mag., 43, 1075. 

Harpur, J., and Dorn, J. E., 1957, Acta Met., 5, 654. 

HERRING, C., 1950, J. appl. Phys., 21, 459. 

HoFrrmMann, R. E., Prxus, E. W., and Warp, R. A., 1956, T'rans. Amer. Inst. 
min. (metall.) Engrs, 206, 483. 

Koper, A., Letaw, H. JR., Suirxry, L., SonpER, E., and TomizuKA, C. T., 1954, 
Phys. Rev., 96, 1224. 

Lomer, W. M., 1957, Report of a Conference on Vacancies and Other Point 
Defects in Metals and Alloys (London: Inst. Metals), p. 94. 

Maxwy, S. M., Rows, A. H., and Lz Cuairg, A. D., 1957, Proc. phys. Soc. Lond. B, 
70, 545. 

Naparro, F. R.N., 1948, Report of a Conference on the Strength of Solids (London: 
Physical Society), p. 75. 

OKKERSE, B., 1956, Phys. Rev., 103, 1246. 

Pranatis, A. L., and Pounn, G. M., 1955, Trans. Amer. Inst. min. (metall.) 
Engrs, 203, 664. 

Roti, B. V., 1939, Phys. Rev., 55, 231. 

TomizuKa, OC. T., and SonpER, E., 1956, Phys. Rev., 103, 1182. 

Uprn, H., SHater, A. J., and Wutrr, J., 1949, Trans. Amer. Inst. min. (metall.) 
Engrs, 185, 186. 

Upry, H., 1951, Trans. Amer. Inst. min. (metall.) Engrs, 191, 63. 


[ 1042 ] 


A New Technique for Decoration of Cleavage and Slip Steps on 
Ionic Crystal Surfaces} 


By G. A. Bassett 
Tube Investments Research Laboratories, Hinxton Hall, Cambridge 


[Received June 10, 1958] 
§ 1. IvTRODUCTION 


"Tu markings and step structures observed on cleavage surfaces of crystals 
have long been used for a study of crystal imperfections (for example, 
Zapffe and Worden 1949, Forty 1957). Most observations have been 
made by optical microscopy or by some form of interferometry. Although 
the latter method is able accurately to measure the height of steps down 
to 54 in favourable circumstances, steps which are close together on the 
crystal surface cannot be distinguished (Amelinckx 1951). Surface 
replicas for study by electron microscopy also do not have sufficient reso- 
lution in a vertical direction to detect the smallest steps on crystal surfaces. 

In the course of an investigation of the nucleation and growth of epitaxial 
films a new technique has been discovered which is able to display unit 
lattice steps on cleavage surfaces of sodium chloride and has the lateral 
resolution of the electron microscope. 


§ 2, EXPERIMENTAL TECHNIQUE 


A small quantity of gold which would give a film of mean thickness 10 4 
or less if spread uniformly over the surface is deposited on the crystal 
surface by vacuum evaporation from a ‘ V’-shaped tungsten filament. A 
film of carbon is then deposited (Bradley 1954) on top of the gold layer, 
the vacuum evaporator being arranged so that the two evaporations can 
be made without opening the system. The carbon film is removed from 
the substrate carrying with it the gold deposit from the crystal surface, 
by lowering gently into water. At the thickness of gold deposit used, a 
continuous film is not formed but large numbers of small nuclei form on 
the crystal surface. Nuclei form in greater numbers along the edges of 
steps on the crystal surface which are thus made visible in the electron 
microscope as chains of gold particles. On the parts of the crystal surface 
that are free of steps the nuclei are randomly dispersed. 


§ 3. OBSERVATIONS 
3.1. Sodiwm Chloride 


The pattern of steps revealed by nucleation of the gold on a cleavage 
surface of rocksalt may vary considerably over even the relatively small 


ee EE eee 

+ Communicated by Dr. J. W. Menter. An account of this work was pre- 
sented at the Conference on the Strength of Whiskers and Thin Films held at 
Hinxton Hall, and the Cavendish Laboratory, Cambridge, March 17th-19th, 1958. 


Step Decoration on Ionic Crystals 1043 


field of view available in the electron microscope. (0-5x0-5mm). A 
typical cleavage step is shown in fig. 1, Pl. 63 decorated with a 5A film of 
gold. The insert in fig. 1 shows part of a cleavage step at a higher magni- 
fication. The ‘staircase’ of which the large step is made up can be 
readily seen. There are also in the field of view a number of single chains 
ofnuclei ; it is considered that these mark single steps on the crystal surface, 
although not necessarily steps of unit height. Many such single steps are 
observed following a wandering path eventually to join up with a larger 
cleavage step on another part of the surface. 

It is possible to obtain a direct measure of the height of a cleavage step 
on the rocksalt surface where the step is crossed by a slip step. In rock- 
salt slip is generally of the (110) [110] type. The (110) glide plane makes 
an angle of 45° with the (100) cleavage plane and intersects it along a 
[001] direction. Slip on the glide plane causes a slip step, the surface 
trace of which is deflected as it crosses a cleavage step by an amount equal 
to the height of the cleavage step. Effectively, one may regard the slip 
step asa marker on the cleavage plane. In fig. 2, Pl. 64, AB can be identified 
as_a slip step since, (a) the step terminates on the crystal surface at A 
where there must be an emergent dislocation having a component of its 
Burgers vector perpendicular to the cleavage surface, and (b) the step is 
straight for a long distance. As this slip step crosses the cleavage step it 
is deflected by 150+8A4. Although the resolution of the decorating par- 
ticles is not quite adequate for the number of small steps to be determined 
exactly, it is possible to count 45+ 2 steps. Now if each of these smaller 
steps is a monatomic step the height of the cleavage step is 125 A + 6 A (a half 
unit cell for rocksalt is 2-8 A). Thus, it may be concluded that most of the 
individual steps of the ‘staircase’ making up the large cleavage step are 
monatomic steps and from this it is considered that steps of atomic height 
on the crystal surface are decorated by preferential nucleation of the gold 
particles. 

Figure 3, Pl. 64 shows another slip step turning abruptly through 90° 
and is considered to be an example of cross slip. If this is the correct 
interpretation of the picture it provides strong evidence for slip on planes 
different from the usual ones. If slip has occurred on a (110) [110] system 
then (001) is the cross slip plane. A number of similar examples have 
been observed. 

Further complex arrays of steps are shown in fig. 4, Pl. 65, which is near 
a light scratch made on the crystal surface after cleavage. It is clear that 
if the technique is to be used for a study of plastic deformation more re- 
fined methods of deformation must be used. 

Rocksalt cleavage surfaces have been examined after deformation by 
bending at room temperature in air. Decoration replicas from such 
specimens showed no significant differences from surfaces as cleaved. Slip 
lines are relatively rare on both types of surface. This observation is 
consistent with the work of Pratt (1953) who has found that when rocksalt 
is deformed at room temperature in air, slip appears to be confined below 


1044 G. A. Bassett on a New Technique for Decoration of 


the surface layers of the crystal and produces a surface rumpling rather 
than sharp slip steps. 
3.2. Other Materials 


A number of other surfaces have been examined to indicate the range of 
applicability of this type of decoration technique. Surfaces tried in- 
clude mica cleavage, lithium fluoride as cleaved and after etching with 
C.P.4, sodium and potassium chloride whisker surfaces grown from 
solution and zine cleavage surface. Although nucleation of the gold 
deposit has always occurred and it has been possible to remove the gold 
nuclei adhering to a carbon film, step decoration has been obtained only 
on lithium fluoride cleavage surfaces. The cleavage of lithium fluoride is 
smoother than that of rocksalt and a decoration replica of a freshly cleaved 
lithium fluoride surface shows fewer steps than a rocksalt surface. How- 
ever after deformation by bending, large numbers of slip lines can be 
decorated on the tension face (001) of the crystal. These slip lines are all 
parallel and would be expected from the two sets of (110) slip planes 
operating. The side face (100) of the crystal shows a ‘tartan pattern’ of 
slip steps intersecting at right angles. It is a characteristic of this pattern 
that the slip steps on one system are sensibly straight whilst the steps on 
the other system are wavy. 


§ 4. Discussion 


The mechanism by which the decoration phenomenon occurs is not 
entirely clear. Gold atoms arriving during vacuum evaporation at the 
crystal surface have a certain mobility on the surface after arrival. 
Nucleation apparently occurs more readily along the edges of steps and 
gives rise to a smaller spacing between nuclei. The size of gold nuclei along 
step edges is, as might be expected, less than for nuclei formed on flat parts. 
of the crystal substrate. For a gold film with a mean thickness of 3 A the 
size of the nuclei along step edges is from 25 A down to 104; on the flat 
areas of the crystal nuclei range down in size from about 604. It can 
be seen that nuclei formed along the terminal step of a cleavage ‘stair- 
case’, i.e. the step adjoining a flat region of crystal, are intermediate in 
size between nuclei formed on the flat crystal and in the centre of the 
cleavage step. This is because the mobility of the gold on the crystal 
surface permits this terminal step to draw upon mobile material on the 
flat surface on one side of the step. Some measure of the distance gold 
atoms can migrate on the crystal surface can be gained from a consideration 
of the wandering steps on the crystal surface. When two such steps are: 
at a greater distance apart than some critical distance gold nuclei form 
with a density and size corresponding to the flat regions of the crystal. As 
the steps get closer together than about 150-200 4 all the arriving gold is 
able to migrate to nucleating sites along the step edges. 

Further work is required to elucidate the manner in which the tempera- 
ture of the substrate and the rate of deposition influence the nucleation of 
the deposit layer. 


Cleavage and Slip Steps on Ionic Crystal Surfaces 1045 


ACKNOWLEDGMENTS 


I wish to thank my colleagues, Dr. J. W. Menter, Dr. A. J. Forty and 
Dr. B. H. Kear for helpful discussions in the course of this work. This 
work is published by permission of the Chairman of Tube Investments 
Limited. 


REFERENCES 


AMELINCKX, S., 1951, Phil. Mag., 42, 324. 

Bravtey, D. E., 1954, Brit. J. appl. Phys., 5, 65. 

Forty, A. J., 1957, Proc. roy. Soc. A, 242, 392. 

Pratt, P. L., 1953, Discussion Properties of Metallic Surfaces (London : 
Institute of Metals, 1954), p. 346. 

ZapFFE, C. A., and WorDEN, C. O., 1949, Acta cryst., 2, 377. 


[ 1046 ] 


The Knight Shift in Superconductors 


By V. Herve and A. B, Prpparpt 
Royal Society Mond Laboratory, Cambridge 


[Received July 15, 1958] 


ABSTRACT 


The Bardeen, Cooper and Schrieffer (BCS) theory of superconductivity 
predicts a zero Knight shift at 0°K, An analysis of this result shows how 
it is related to the exact pairing between electrons of opposite k and opposite | 
spin in this theory. The analysis and other considerations suggest a modifica- 
tion of the BCS theory, which then gives a Knight shift in reasonable 
agreement with Reif’s measurements. 


AurHoucH the experimental evidence concerning the Knight shift in 
superconducting mercury is not entirely concordant (Reif 1957, Knight 
et al. 1956), the more detailed study of Reif indicates that it is unlikely to 
vanish at any temperature and that therefore the superconductor at 
0°xK still retains some of the spin paramagnetism of the normal metal. At 
first sight this appears to conflict with the theory of superconductivity 
recently developed by BCS (Bardeen et al. 1957 a,b) which predicts 
zero spin paramagnetism at 0°K (see for instance Yosida 1958). However 
we wish to show how an analysis of this result suggests that the conflict 
arises more from the detailed form taken by the BCS theory at present 
than from any essential weakness. Furthermore, this analysis together 
with some plausible physical reasoning suggest in an heuristic manner a 
simple generalization of the BCS theory. On the basis of this generaliza- 
tion we have made a rough calculation of the spin paramagnetism of 
the superconductor at 0°K and obtained three-quarters of the value for 
the normal metal, in reasonable agreement with Reif’s result. 

Let us begin by considering a normal metal containing N conduction 
electrons in a magnetic field H, and let $N + n of the electron spins be 
directed parallel to the field, so that the moment is 28, 8 being the Bohr 
magneton. ‘Then the energy of the assembly is 


En) — E(0) = n?/N(0) — 2nB8H, Erie ays, 1b hl 
where NV(0) is the density of states for one spin at the Fermi level; the 
first term is kinetic energy, the second magnetic potential energy. A 
minimum of H(n) occurs when n=N(0)8H, and this gives the usual 
expression for the Pauli paramagnetism. Now in a superconductor the 
electrons are in interaction with one another and one must add to (1) 
a term representing the change in interaction energy consequent upon a 
reversal of n spins. In the BCS theory there is an exact pairing of electrons 


a 


7 Communicated by the Authors. 


On the Knight Shift in Superconductors 1047 


with opposite wave-number and spin, i.e. between the Bloch states p,.» and 
~,, and in the ground state the occupation probabilities g,, and g_, ‘ 
are equal, every electron being paired with its opposite number. To 
turn a spin over involves breaking one pair and this raises the energy 
of the whole system by a finite amount 2«) (approximately equal to 
3-5 kT’, where 7’, is the transition temperature of the superconductor). 
Thus in the strict BCS theory one has to add to (1) an interaction term of 
the form 2e)||. It is now clear that unless H> «,/8 (about 100 kG 
for mercury), #() has a minimum when n= 0 and the spin paramagnetism 
vanishes, the spins being locked by the interaction into an antiferro- 
magnetic-like ground-state. 

An escape from this situation is provided by any mechanism which 
replaces the interaction term 2¢)|n| by one which is quadratic in n, and 
this obviously means that there must be no energy gap of the order of 
kT, separating the spin states n=0 and n=1. This can be achieved by a 
relaxation of the condition of exact pairing. In qualitative terms this. 
means that when a pair of electrons in the ground state is broken in order to 
reverse the spin of one, it is not necessary to lose all the interaction exergy 
contributed by this pair; even though they have now no partners of 
precisely opposite k and opposite spin, they may benefit nearly as much 
from interactions with electrons of opposite spin and nearly opposite k. 
This does not appear to be physically unreasonable or in conflict with other 
aspects of the BCS theory; in particular it is not inconsistent with there 
being an energy gap of 2¢«, for the excitation of an electron across the gap 
into a ‘normal’ state (‘normal’ as opposed to ‘superconducting’ in the 
sense of the two-fluid model). We conclude therefore that in order to 
account for a Knight shift at 0°K, it is necessary to introduce into the 
theory some correlation between k’ + and —k | for unequal k and k’, 
and to count such a correlation as forming at least partially a valid pair. 

We can sharpen this conclusion by a more mathematical discussion. 
Consider the probability P(k’ + , —k ) ) that the states k’ } and —k | 
are simultaneously occupied. In the BCS theory in the general case 
when n may be non-zero, the probability takes the form 


P(k't , —K Y= Pps |e et Ont Oey | ¥acs) 
=9re1I—Ky + 9-4 (1 Ge Ore . . . . ° ° (2) 
Thus when k and k’ are not equal the probability is 9,,,g_,,, implying 


no correlation, while when k=k’ it is g_,,, implying perfect correlation. 
Analysing the latter case further, we write 


P(kt , —Kk y )=(9-4y)(1) 
=(probability of 4_,, being occupied, ie. g_,,) 
x (probability that if %_,, is occupied, then #,, is 
also, i.e. unity). ee ee ee (3) 


Now in the BCS theory, as we saw above, the zero Knight shift is a conse- 
quence of the term 2«,)|| in the energy E(7) — H(0), due to the breaking 


1048 V. Heine and A. B, Pippard on the 


of pairs when turning the spins over. In view of the observed Knight 
shift, we can say therefore that for n #0 the lowest energy wave function 
‘V'ycg in the strict BCS theory contains too few pairs. If we ask how the 
number of pairs in the wave function can be increased, we see immediately 
that in (3) the second factor is a probability and cannot be increased beyond 
its present maximum value of unity. There is no chance of introducing 
ereater correlation between k + and —k | purely by spreading the total 
spin uniformly over all pairs as in a spin wave, or by any such device. A 
somewhat similar conclusion has been reached by Yosida (1958). However 
the amount of correlation (2) can be increased between —k and k’ (#k). 
We conclude that the explanation of a non-zero Knight shift demands a 
theory involving correlations between the states y,, and f_,, for k#k’. 

We now propose a possible modification of the BCS theory which is 
based on a more general type of correlation between k’ + and —k | as 
suggested by the above reasoning, and which gives a finite Knight shift. 
In (2), we leave alone the first term which is just the statistical uncorrelated 
probability, and we change the second term. We propose to smooth out 
the interaction by replacing the 6,,, by a function F(q) of q=k—k’ such 
that SF (q)=1. This introduces the required correlation between —k | 


and k’ + , and the success of the original BCS theory suggests that F'(q) 
should be sharply peaked about q=0. In fact Cantor and Martin (1958) 
have shown theoretically that F(q) should have a spread in q of the order 
of 1/€), which is what one would expect physically from the interpretation 
of €)as the coherence range. In any case it is difficult to believe that nature 
prefers a delta function to some such smooth function. It remains to 
choose the coefficient of F(q). Since the correlation is now spread over 
a range of energy and of q which may be taken to be large compared with 
fH, the simple probability considerations of (3) do not limit us any more. 
We propose that in a region of size € the electron—phonon interaction 
tries to correlate everything with everything else it can, and therefore 
that the correlation function is symmetrical between k’* and —k .. 
This symmetry assumption is sufficient to ensure a Knight shift. It 
means that the number of pairs is determined not as in (2) by the expression 
9-41 —e4) which differs by a term linear in n from its value in the ground 
state, but is determined by some average between g_, ,A- ee.) and 
9v+(1—-g_.,). Any average will differ from the ground state (n=0) 
value of g,(1—g,) only by second order quantities in n, and thus lead to 
an n® term in (1) and a non-zero Knight shift. The numerical value 
of the shift will depend on which particular average is used, and we 
arbitrarily select the geometric mean. This has the Hartree-like simplicity 
of being a product of two independent weighting factors, one from k and 
one from k’. We propose therefore 
Eke | a Kenn) 

= rer Grey + Les l= Ger atey (1 = 9-1, ) PAF (k — k’) 7 

as a plausible generalization of (2). This expression may be regarded 


Knight Shift in Superconductors 1049 


as the matrix element of o, .*cy..¢,, *e_., when the wave function Baer 
in (2) is replaced by a supposed ‘better’ function ‘!’, and by analogy we 
write down an expression for the slightly more general matrix element 
(k#k’) 
Bal Pet tera Caeig en | ED 
= [9 (1 EO Ee ag a =U ee (J meat) 
X Jen (1 — gy 4) PAF (q). . . . . . . . . . (5) 


Tt is this matrix element which determines the interaction energy. 

We may now estimate the form of H(n) in a superconductor by use of 
the following assumptions and approximations: 

(1) Equation (5) and a constant interaction matrix element V (as in 
BCS). 

(2) 9(€)=9ole—”/N(9)),9  (€) =Go(e + n/N (0)), where go(c)is the function 
(2°35) of BCS (1957 b). 

(3) F(q) has a width in energy much less than that of g,(e), but H is 
small enough so that BH is much less than the width of F(q). The first 
part of this assumption is not justified if the spread of F(q) is of order 
1/&, since 1/€) is comparable with the width of g,(e), but the assumption 
represents a great arithmetical simplification in the evaluation of the 
integrals, 

The calculation is straightforward, and results in replacing ['(e) by 


[T'(e+n/N(0))P(e—n/N(0)) 2? A eee Boe} 
in eqn. (4) of BCS (1957 a), which now exhibits explicitly the symmetry 
between up and down spins. Then on evaluation (1) takes the form 

E(n) — (0) =n?/N(0) — 2nBH + 3n?/N(0), 
the last term being the interaction term. Thus the coefficient of n? in 
(1) is simply multiplied by 4, and correspondingly the spin susceptibility 
and Knight shift are reduced to three-quarters of their normal value. 
_ This is not dissimilar from Reif’s value of about two-thirds, and is certainly 
within the combined limits of error of theory and experiment. 

By way of conclusion, we recapitulate briefly the salient features of our 
discussion. We have assumed the general correctness of the BCS theory 
of superconductivity but have made the assumption that the interaction 
between the electrons is completely symmetric between up and down spins, 
as illustrated by the approximate form (6) of the interaction energy. This 
assumption, which we suggest is physically plausible, both implies and is 
implied by a non-zero spin paramagnetism at 0°K and a Knight shift. 
It means that there is no energy gap for turning a spin over, in contrast 
to the energy gap for exciting an electron into a ‘normal’ state. However 
the assumption is incompatible with the original form of the BCS theory 
in which correlation is restricted purely to the states ,, and $_,, in 
pairs, because the assumption would imply a probability in (3) being 
greater than unity: but the incompatibility disappears if the correlation 
is spread out slightly in k space. This spreading out is supported also 


P.M. ni 


1050 On the Knight Shift in Superconductors 


by independent physical and theoretical arguments, and there is no reason 
to believe that it affects the existence of an energy gap for excitations to 
‘normal ’ states. Thus the characteristic properties derived from electrical 
conduction (e.g. Meissner effect) may be expected to follow as in the BCS 
theory except for quantitative corrections. These are probably minor 
for thermodynamic properties but may be more serious for transport 
effects, because the latter depend on matrix elements which are more 
sensitive than energy values to the form of the wave function. We 
emphasize again that our approach has been entirely heuristic, and in 
particular we have not established that an actual wave function having 
our proposed correlation exists. We hope however that this work points 
out in what direction a theory of superconductivity should be sought, 
which combines the successes of the BCS theory with an explanation of 


the Knight shift. 


REFERENCES 


BaRvEEN, J., Cooper, L. N., and Scurierrer, J. R., 1957 a, Phys. Rev., 106, 
162 ; 1957 b, [bid., 108, 1175. 

Cantor, A. J., and Martin, P. C., 1958, Bull. Amer. phys. Soc., 3, 202. 

Knieut, W. D., ANpRoxEs, G. M., and Hammonp, R. H., 1956, Phys. Rev., 
104, 852. 

Retr, F., 1957, Phys. Rev., 106, 208. 

Yosrpa, K., 1958, Phys. Rev., 110, 769. 


[ 1051 ] 


CORRESPONDENCE 
The Transient Conductivity Increase in Deformed Alkali Halides 


By A. Taytor and P. L. Prarr 


Department of Physical Metallurgy, University of Birmingham 
[Received July 14, 1958] 


Karty observations of the transient effects of plastic deformation upon 
the ionic conductivity of alkali halide crystals were made by Gyulai and 
Hartly (1928) and by Stepanow (1933). In these experiments two 
different effects were observed, a deformation-induced charge-flow 
without an applied electric field, and a temporary increase in the ionic 
conductivity. The deformation-induced charge-flow has been studied 
more recently by Fischbach and Nowick (1955, 1958) and by Caffyn and 
Goodfellow (1955), and its interpretation in terms of charged dislocations 
has been discussed elsewhere (Pratt 1957). In this note we wish to 
comment only on the enhancement of the ionic conductivity produced 
by deformation. 

Seitz (1950, 1952) has interpreted this enhancement of ionic conductivity 
during compression in terms of the production of lattice vacancies by the 
intersection of moving dislocations; a deformation of 10% must have 
generated a density of vacancies of 1018/em* to account for the 100-fold 
conductivity increase. Against this Fischbach and Nowick (1958) 
suggest that the increase is due to the break-up of associated divalent 
impurity-vacancy complexes, because the number of vacancies appears to 
them unreasonably large for generation by intersecting dislocations. 
Further they cite the finding of Caffyn and Goodfellow that the Gyulai— 
Hartly effect varies widely with the source of the crystal, and is not 
observed at all in some crystals. 

In our experiments on melt-grown crystals, we have found that tall 
specimens of both NaCl and KCl, typically lem tall by 0-4cem by 0-4cm, 
show no conductivity increases during compression at 90°C up to strains 
at which the crystals shatter; whereas squat specimens as used by the 
other workers, typically 0-5cm square in cross section and 0-2cm tall, 
show conductivity increases but only in the later stages of deformation. 
Following the Russian studies of irrational twinning produced by the 
compression of similar squat specimens (Brilliantow and Obreimow 1937, 
Startzev 1940, Classen-Nekludowa 1943), we have examined in some 
detail the inhomogeneous deformation which takes place. Slip occurs 
independently on separate (110) [110] slip systems in different parts of the 
crystal, in such a way that the crystal is divided macroscopically into 
blocks. In adjacent blocks slip occurs on systems which meet obliquely, 


1052 Correspondence 


so that the boundaries between the blocks are (110) planes parallel to the 
axis of compression, fig. 1. Mixed edge-screw dislocations from the two 
slip systems meet at the boundary, as Vaughan et al. (1958) have also 
recently observed. Viewed from the side of the compressed specimen, 
fig. 2, Pl. 66, these boundaries are readily seen. The dislocations retained 
in the boundary cause a rotation of the lattice on either side about the [Tay 
line of intersection of the two slip systems. This rotation gives rise to tilts 
on the side of the compressed plate, proportional in magnitude to the 
total strain, and this led to the Russian description of the effect as 
‘irrational twinning’. At higher stress levels, slip penetrates the 
boundaries between the blocks, fig. 3, Pl. 66, and wavy slip lines appear 
both near the boundary, and within the blocks, fig. 4, Pl. 66. Simul- 
taneously with this penetration of the boundaries, and not before, 
transient enhancement of the ionic conductivity is observed. 


Fig. 1 


Formation of blocks by slip on separate systems. Broken lines denote traces 
of slip planes in sides of specimen. 


These results appear to support the interpretation that point defects 
generated by intersecting dislocations are responsible for the conductivity 


increase. We hope to publish a fuller account of these experiments 
shortly. 


. REFERENCES 

BRILLIANTOW, N. A., and Opreimow, I. W., 1937, Phys. Z. Sowjet., 12, 7. 

CAFFYN, J. E., and GoopFEtiow, T. L., 1955, Nature, Lond., 176, 878. 

Cae ree M., 1943, J. Phys., Moscow, 7, 272. 

‘ISCHBACH, D. B., and Nowtck, A.S., 1955, Phys. 333; olen, : 
Buca ein ys. Rev., 99, 1333; 1958, J. phys. 

Sau Z., and Hartiy, D., 1928, Z. Phys., 01, 378 

RATT, P. L., 1957, Inst. Metals, Monograp hand Re ort Seri 

Surz, F’., 1950, Phys. Rev., 80, 239; 1952 ‘Aduine Dive 1 sane 

STARTZEY, V. T., 1940, J. Phys., Moscow, 3, 107. a 

Srepanow, A. W., 1933, Phys. Z. Soryjet., 4, 609. 


VAuUG N 
AT ie se! Lutvo, W. J., and Smonucuowsk«1, R., 1958, Phys. Rev., 


Correspondence 1053 


The Nuclear Magnetic Moment of Plutonium-239 


By J. Burrerwortu 
Metallurgy Division, A.E.R.E., Harwell, Berks. 


[Received May 20, 1958] 


Iv has been shown by optical spectroscopy (Van den Berg and Klinkenberg 
1954) and electron paramagnetic resonance methods (Bleaney et al. 1954) 
that the *°Pu nucleus has a spin of } and a magnetic moment. By these 
methods, the best value obtained for the nuclear magnetic moment was 
0-4+0-2 nuclear magnetons. To determine the value of the moment 
more accurately, the nuclear magnetic resonance of ?°°Pu was sought with 
a nuclear magnetic induction spectrometer. 

Because all the known salts of plutonium are paramagnetic, it seemed 
most likely that the nuclear magnetic resonance from such salts would be 
undetectable because of gross broadening. Consequently, the resonance 
was sought in plutonium metal where it is possible that the paramagnetic 
electrons are in a conduction band and thus their field at the nuclei is 
averaged by their motion. Another reason for seeking the resonance in 
the metal was that the shape of the resonance line could yield information 
about the unknown electronic structure of the metal. 

The sample consisted of just over 9g of «-plutonium of which about 
99% was *8°Pu, the rest being mainly 74°Pu and *Pu. Other elements, 
present as dissolved impurities, included some paramagnetic atoms, but 
these amounted in number to less than 0-02°% of the sample, too small an 
amount to significantly broaden the resonance. The sample was a powder 
of particle diameter less than 50, to allow the applied radio-frequency 
field to penetrate it. 

The resonance was sought in a direct magnetic field of 4200 gauss at 
room temperature. To get as large a signal to noise ratio as possible, the 
dispersion mode of the resonance was sought with the relatively large 
rotating radio-frequency field of. 0-5gauss, and the bandwidth of the 
spectrometer was reduced to 1/64c/s. It was estimated that the signal to 
noise ratio of the spectrometer was sufficient to allow the resonance of 
239Pu to be observed even if the width of the line were as great as 30 gauss. 

The search covered the whole range predicted by the electron para- 
magnetic resonance results, namely 0-2 to 0-6 nuclear magnetons. No 
resonance was detected. 

The absence of the resonance indicates that either the value of the 
nuclear magnetic moment lies outside the range searched or the resonance 
line has been broadened out of recognition. Since the present work was 
completed experimental evidence has been produced that supports both 
indications. From the results of an atomic beam resonance experiment 
there has been deduced a value for the nuclear moment of *°°Pu of about 


1054 Correspondence 


0-02 nuclear magnetons (Hubbs et al. 1958) while preliminary measure- 
ments of the magnetic susceptibility of a-plutonium (Olsen, private 
communication) appear to give evidence, in the form of a strong negative 
temperature dependence of the susceptibility and a magnetic transition 
at a low temperature, for the existence in «-plutonium of localized unpaired 
electrons. Such electrons would almost certainly cause the obliteration 
of the nuclear magnetic resonance. 

Apart from localized electrons, a number of line-broadening mech- 
anisms are known to be present in w-plutonium, namely, the direct nuclear 
spin-spin interaction, the nuclear spin-lattice relaxation, the anisotropy of 
the Knight shift, and the indirect nuclear spin-spin interaction. The extent 
to which each contributes to the line width cannot be calculated accurately 
without a more detailed knowledge of the wave functions of the conduction 
electrons than is available. However, a rough but pessimistic estimate 
indicates that even their combined effect is not enough to account for the 
absence of the line. Consequently, the absence of the line, if due to gross 
broadening, must be attributed to localized unpaired electrons. 

The present situation, therefore, is that the value of the nuclear magnetic 
moment of ?39Pu is known with little certainty, and there seems little 
chance of finding it by nuclear magnetic resonance methods in a-plutonium. 
The chances of finding the resonance would be greatly increased if a 
diamagnetic salt of plutonium could be found and used asasample. Alter- 
natively, if one of the high temperature phases of plutonium were 
found, the magnetic susceptibility of which was relatively temperature 
independent, then, provided such a phase could be stabilized at a lower 
temperature, it would be possible to carry out a nuclear resonance experi- 
ment with a better signal to noise ratio and more conveniently than at the 
higher temperature. If the value of the moment is as low as is indicated 
by the atomic beam experiment, namely 0-02 nuclear magnetons, then it 
will be necessary to achieve sample temperatures of the order of 20°K. 


ACKNOWLEDGMENTS 
The author is indebted to Dr. H. M. Finniston for suggesting this work, 
to him and Mr. A. D. Le Claire for their interest and encouragement, 
and to Dr. L. EK. Drain for helpful discussions. Thanks are also due to 
Mr. M. J. F. Notley who prepared the sample. 


REFERENCES 
BueaNeEy, B., LunEwetiyn, P. M., Pryce, M. H. L., and Hatz, G. R., 1954 
Phil. Mag., 45, 991. 
Husss, J. C., Marrus, R., NreREnBERG, W. A., and WorcESTE 
; ae Bs INBERG, W. A., a JESTHR, J. L., 1958, 
Phys. Rev., 109, 390. 4 
VAN DEN Bere, M., and Kiinkensere, P. F. A., 1954. Physica, 20, 461. 


[ 1055 ] 


REVIEWS OF BOOKS 


Progress in Klementary Particle and Cosmic Ray Physics—IV. By J. G. WiLson 
and S. A. Wournuysen. (Amsterdam: North-Holland Publishing Company.) 
[Pp. 470.] 45 guilders. ‘ 

Tats volume is the fourth in a series in which the first three were entitled 
Progress in Cosmic Ray Physics. It contains three articles on Elementary 
Particles and two on Cosmic Rays. With the development of particle accelera- 
tors in the Bev range a new generation of physicists has grown up to whom the 
properties of new particles and the astrophysical or geophysical aspects of cosmic 
rays would appear to make strange bedfellows. It seems likely that the greater 
rate of growth of work on new particles will leave its companion a smaller share 
in future, which may give further incentive to the issue of separate volumes on 
the two fields. 

The Primary Cosmic Radiation and its Time Variations is described by S. F. 
Singer in a very readable article which contains some impressive diagrams 
including a few in the Picasso tradition. The origin of Cosmic Radiation is 
discussed by V. L. Ginzburg, who describes present theories and makes useful 
suggestions for further experimental work in both cosmic ray physics and radio 
astronomy. 

Some Theoretical Aspects of the Strong Interactions of the New Particles are 
described by B. d’Espagnat and J. Prentki, who deal particularly with the 
possibility of classification by means of three or four dimensional iso-spin 
space. The “ Properties and Production of K-mesons”’ by W. D. Walker 
describes experimental work up to about mid-1957, and its interpretation before 
the non-conservation of parity had become a familiar concept, (though this is 
just mentioned). The theoretical and experimental aspects of the Interaction of 
u-Mesons with Matter are surveyed by G. N. Fowler and A. W. Wolfendale. 

R. J. E. 


Mechanical Resolution of Linguistic Problems. By A. D. Booru, L. BRANDWoOoD 
and J. P.CLzARE. (London: Butterworths Scientific Publications; New York: 
Academic Press Inc.) [Pp. vii+306.] 50s.; $9.80. 


Tus book gives an account of some of the results obtained at Birkbeck College 
Computational Laboratory on the application of the digital computer APEXC 
to linguistic problems. A short historical account of mechanical translation 
from 1947 to the present day is included. 

The book proceeds to deal cursorily with such matters as machine methods for 
making word counts and concordances (apparently the Jesuits are using an 
J.B.M. computer in the preparation of a concordance to the works of Aquinas) 
and computer methods for the solution of problems of stylistic analysis (with 
special reference to the chronological dating of Plato’s ‘ Dialogues’). By far the 
greatest section of the book (Chapter 9), however, is devoted to what the authors 
term “a conglomeration of the main problems to be faced in translating 
German together with one or two suggestions on how to overcome them ”. 
This chapter takes, as its basis, the work already done by Oswald and Fletcher 
on the mechanical resolution of German syntax patterns and endeavours to 
extend it. 

Many of the problems arising from German grammar and syntax are discussed 
at length and documented with copious examples. Indeed the whole chapter 
well illustrates the principle that programmers and linguists must work in the 
closest collaboration in the MT field—never in isolation. No reference is made to 
detailed programming, this being left for a companion volume, 


1056 Reviews of Books 


The final chapter of the book presents an assessment of the special features 
of a machine built for translation. (A proposed instruction code for such a 
machine is also given.) It is concluded that the construction of such a machine 
would require an expenditure of between £50,000 and £100,000. 

While the book under review rightly highlights some of the problems involved 
in MT (more especially in the case of German and, to some lesser extent, French), 
it cannot be said that the ad hoc solutions here proposed for these particular 
languages will greatly assist in solving the general problems. One is left with 
the feeling that a more general approach is necessary. M. M. 


Observation and Interpretation—A Symposium of Philosophers and Physicrsts. 
Edited by 8S. Komrner. (London: Butterworths Scientific Publications.) 
[Pp. 218.] 40s. 

Tus is a discussion of questions belonging to the philosophy of quantum 
mechanics and borderline problems connected therewith, being a reprint of the 
papers given together with a verbatim report of the discussions that followed. 
Two of the seven sections explicitly deal with problems surrounding the con- 
ception of probability in its relation to quantum mechanics. One section 
includes Bohm’s attempt at a formulation of quantum theory in terms of hidden 
variables at a sub-quantum-mechanical level. Relating both this and the 
former sections is the question whether probability is an ultimate property of 
a sequence (alternatively, according to Popper, of the experimental arrange- 
ment that gives rise to it) or whether the events related by a probability law 
should be viewed as related by causal laws—at a ‘ deeper level’ (Bohm). And 
this point again connects with another, whether a correct theory of measure- 
ment should involve a ‘ realistic interpretation of the formalism of quantum 
mechanics ’ (Feyerabend). In addition to these more technical papers there 
are others of a more purely ‘ philosophical’ nature. Often these fit easily 
into the general scheme; thus the question of the position of the ‘ observer ’, 
putatively involved in the definition of physical concepts, crops up in the crucial 
arguments about the foundations of quantum theory (Rosenfeld), the papers 
on the theory of measurement, but also in those dealing with ‘ pure ’ questions 
of prediction and determinism. 

Participants include Ayer, Bohm, Braithwaite, Popper, Ryle, Rosenfeld, 
Vigier, Suessmann, Bopp, Groenewold, just to mention some of the 40 odd 
speakers. The book emphasizes the importance of trying to get clarity on 
fundamental concepts; above all, it shows physicists and philosophers dis- 
puting—thus exhibiting the very nerve of philosophical activity. It makes 
fascinating reading and should be of equal interest to both physicists and 
philosophers. Gels 


[The Editors do not hold themselves responsible for the views 
expressed by their correspondents. | 


D. R. BRAME and T. EVANS Phil. Mag. Ser. 8, Vol. 3, Pl. 60. 


Fig. 4 


nted gold film deformed on a single Dislocation configuration in oriented gold film 
erystal silver substrate. The thinning after deformation on a single crystal silver 
of the film due to slipping is shown in substrate. Slip ona second system is also 
two directions. x 16000. evident. x 80 000. 

Fig. 5 


Slip lines in an oriented gold-palladium alloy film deformed on a single crystal 
silver substrate. x 16 000. 


D. R. BRAME and T. EVANS Phil. Mag. Ser. 8, Vol. 3, Pl. 61. 


Fig. 7 


Shp lines and cracking in an oriented platinum Region of duplex slip in an oriented plati 
film on a single crystal silver substrate. film deformed on a single crystal si 
x 16 000. substrate. x 16 000. 


Cracking and slipping of an oriented rhodium Cracking of oriented rhodium film defor 
film deformed on a single crystal silver on a single crystal silver substi 


substrate. x 16 000. x 16 000. 


D. R. BRAME and T. EVANS Phil. Mag. Ser. 8, Vol. 3, Pl. 62. 


Fig. 13 


mted gold film which cracked when Group of dislocations in an oriented gold film 

deformed on a substrate due to an deformed on a single crystal silver sub- 

underlying rhodium film acting as a strate. A rhodium film acted as a barrier 

barrier to dislocations. x 16 000. on the other side of the silver substrate. 
x 80 000. 


Fig. 16 


ther group of dislocations in the same Cracking of polycrystalline rhodium film in 
specimen as fig. 13. x 80000. one direction. x16 000. 


G. A. BASSETT 


i 
ae 


Boo 


eee 
Pie. 
ees 


bet 


* * 


a* 
Fete ey 
bag dete ere Bee ee & 

‘ es 


ae 8% t+ * 
a 
: tee 


aes 
eR 
firepit fe 

a ees : 
pe 


Seathe oe y Het 


« 


oe 


Satie 


i a ce ee a 
ee re 
et eee ee we 


HNP RS Heeee 


Ee Bae & 2% 84 oe oy 
” 


§ ee 4 * 
be Coe ae ae He oO OO ee 


& 
@e os 


ere 
ad * 
“Oe 


~ * 
4 ¢ * 
# aes 


fe % 


en pie, 


oe see oe 
COIR POR 
2 


Pinna teas Bie 


Phil. Mag. Ser. 8, Vol. 3, Pi. 63. 


aati S 
= ete 
Sava 
pases Gece, 
Sas ee, 


Se Re 
are © we SS a, ae 
Pelt db Telos 5 


aC 


a 63 
ee 
nget 
2 


eee Se 


es seis 
sy 


ere 


ois 
5 tate 


Y 


Sgeatine ® 
tet 
€ 


Saeat 


Verne 


, . 
tet HE 


PING 
too eae 


ieee pene ee 
foci x 


Sa Bi ete 


Cleavage steps on cleavage surface of sodium chloride, decorated with gold. 


x 32 000. 


Insert x 200 000. 


ory, 


o 


a2 


ates 


G. A. BASSETT Phil. Mag. Ser. 8, Vol. 3, Pl. 64. 


| Slip step on sodium chloride deflected on crossing cleavage step. x 140 000. 
| 


Cross-slip in sodium chloride. x 200 000. 


Phil. Mag. Ser. 8, Vol. 3, Pl. 65. 


G. A. BASSETT 


Fig. 4 


es 
e 


Mien PL an oe 


SF RO SW CSO SS Moyen xa, ces 


nee 


de near a light scratch on the crystal 


x 120 000. 


1 


Cleavage and slip steps on sodium chlor 


surface. 


A. TAYLOR and P. L. PRATT Phil. Mag. Ser. 8, Vol. 3, Pl. 66. 


Fig. 2 Fig. 3 


Block boundary before intersection. Block boundary after intersection, 
x 80. showing wavy slip. 80. 


Wavy slip away from boundary. 500. 


